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a base sequence coding for a polypeptide having an 
enzymatic activity of converting the 4-methylene group 
of p-ionone ring into a keto group; a DNA chain having 
a base sequence coding for a polypeptide having an 
enzymatic activity of converting the 4-methylene group 
of a 3-hydroxy-p-ionone ring into a keto group; a DNA 
chain having a base sequence coding for a polypeptide 
having an enzymatic activity of adding a hydroxyl group 
to the 3-carbon atom of a 4-keto-p-ionone ring; and a 
process for producing various xanthophylls, such a can- 
thaxanthin and astaxanthin, by introducing the above 
DNA chain(s) into a suitable microorganism, e.g., 
Escherichia coli . followed by expression thereof. 
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Description 

Technical Field 

The present invention relates to DNA strands useful for the synthesis of keto group-containing xanthophylls (keto- 
carotenoids) such as astaxanthin which are useful for heightening the color of cultured fishes and shellfishes such as 
sea breams, salmons, lobster and the like and is used for foods as a coloring agent and an antioxidant, and to a process 
for producing keto group-containing xanthophylls (ketocarotinoids) such as astaxanthin with use of a microorganism 
into which the DNA strands have been introduced. 

Background Art 



The term xanthophylls mean carotenoid pigments having an oxygen-containing group such as a hydroxyl group, a 
keto group or an epoxy group. Carotenoids are synthesized by the isoprenoid biosynthetic process which is used in 

75 common halfway with steroids and other terpenoids with mevalonic acid as a starting material. C15 farnesyl pyrophos- 
phate (FPP) resulting from isoprene basic biosynthetic pathway is condensed with C5 isopentenyl pyrophosphate (IPP) 
to give C20 geranylgeranyl pyrophosphate (GGPP). Two molecules of GGPP are condensed to synthesize a colorless 
phytoene as an initial carotenoid. The phytoene is converted into phytof luene, ^-carotene, neurosporene and then lyc- 
opene by a series of desaturation reactions, and lycopene is in turn converted into p-carotene by the cyclization reac- 

20 tion. It is believed that a variety of xanthophylls are synthesized by introducing a hydroxyl group or a keto group into the 
p-carotene (See Britton, G., "Biosynthesis of Carotenoids"; Plant Pigments, Goodwin, T.W. ed., London, Academic 
Press, 1988, pp. 133-182). 

The present inventors have recently made it possible to clone a carotenoid biosynthesis gene cluster from a epi- 
phytic non-photosynthetic bacterium Erwinia uredovora in Escherichia coli with an index of the yellow tone of the bac- 

25 terium, a variety of combinations of the genes being expressed in microorganisms such as Escherichia CQli to produce 
phytoene. lycopene. p-carotene. and zeaxanthin which is a derivative of p-carotene into which hydroxyl groups have 
been introduced (See Fig. 10; Misawa. N., Nakagawa. M.. Kobayashi, K., Yamano, S.. Izawa. Y. Nakamura, K., Har- 
ashima. K.; "Elucidation of the Erwinia uredovora Carotenoid biosynthetic Pathway by Functional Analysis of Gene 
Products Expressed in Escherichia coli ". J. Bacteriol., 172, p.6704-6712, 1990; Misawa. N., Yamano. S., Ikenaga, H., 

30 "Production of p-carotene in Zvmomonas mobilis and Aorobacterium tumefaciencs by Introduction of the Biosynthesis 
Genes from Erwinia uredovora ". Appl. environ. Microbiol., 57, p. 1847-1849, 1991; and Japanese Patent Application 
No. 58786/1991 (Japanese Patent Application No. 53255/1990): "DNA Strands useful for the Synthesis of Caroten- 
oids"). 

On the other hand, astaxanthin, a red xanthophyll, is a typical animal carotenoid which occurs particularly in a wide 

35 variety of marine animals including red fishes such as a sea bream and a salmon, and crustaceans such as a crab and 
a lobster. In general, animals cannot biosynthesize carotenoids, so that it is necessary for them to ingest carotenoids 
synthesized by microorganisms or plants from their environments. Thus, astaxanthin has hitherto been used widely for 
strengthening the color of cultured fishes and shellfishes such as a sea bream, a salmon, a lobster and the like. More- 
over, astaxanthin has attracted attention not only as a coloring matter in foods but also as an arrti-oxidant for removing 

40 active oxygen generated in bodies, which causes carcinoma (see Takao Matsuno ed., "Physiological Functions and 
Bioactivities of Carotenoids in Animals". Kagaku to Seibutsu, 28, p. 219-227, 1990). As the sources of astaxanthin, 
there have been known crustaceans such as a krill in the Antarctic Ocean, cultured products of a yeast Pnaffia. cultured 
products of a green alga Haematococcus. and products obtained by the organic synthetic methods. However, 
crustaceans such as a krill in the Antarctic Ocean or the like are used, it requires laborious works and much expe? : s 

45 for the isolation of astaxantin from contaminants such as lipids and the like during the harvesting and extraction ot r»3 
krill. Moreover, in the case of the cultured product of the yeast Pnaffia . a great deal of expenses are required for the 
gathering and extraction of astaxanthin, since the yeast has rigid cell walls and produces astaxanthin only in a low yield. 
Also, in the case of the cultured product of the green alga Haematococcus . not only a location for collecting sunlight or 
an investment of a culturing apparatus for supplying an artificial light is required in order to supply light which is essen- 

50 tial to the synthesis of astaxantin, but also it is difficult to separate astaxanthin from fatty acid esters as by-products or 
chlorophylls present in the cultured products. From these reasons, astaxanthin produced from biological sources is in 
the present situation inferior to that obtained by the organic synthetic methods on the basis of cost. The organic syn- 
thetic methods however have a problem of by-products produced during the reactions in consideration of its use as a 
feed for fishes and shellfishes and an additive to foods, and the products obtained by the organic synthetic methods are 

55 opposed to the consumer's preference for natural products. Thus, it has been desired to supply an inexpensive astax- 
anthin which is safe and produced from biological sources and thus has a good image to consumers and to develop a 
process for producing the astaxanthin. 
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Disclosure nf the Invention 

It would be considered very useful to find a group of genes for playing a role of the biosynthesis of astaxanthin, 
because it is possible to afford astaxanthin-producing ability to a microorganism optimum in safety as a food or in poten- 
tiality for producing astaxanthin, regardless of the presence of astaxanthin-producing ability, by introducing a gene clus- 
ter for astaxanthin biosynthesis into the microorganism. No problem of by-products as contaminants is caused in this 
case so that it would be considered not so difficult to increase the production amount of astaxanthin with a recent 
advanced technique of gene manipulation to a level higher than that accomplished by the organic synthetic methods. 
However the groups of genes for synthesizing zeaxanthin, one of the xanthophylls, have already been acquired by the # 
present inventors as described above, while no genes encoding a keto group-introducing enzyme required for the syn- 
thesis of astaxanthin have not successfully obtained. The reason of the failure in obtaining the genes includes that the 
keto group-introducing enzyme is a membrane protein and loses its activity when isolated from the membrane, so that 
it was impossible to purify the enzyme or measure its activity and no information on the enzyme has been obtained. 
Thus it has hitherto been impossible to produce astaxanthin in microorganisms by gene manipulation. 

The object of the present invention is to provide DNA strands which contain genes required for producing keto 
group-containing xanthophylls (ketocarotenoids) such as astaxanthin in microorganisms by obtaining such genes cod- 
ing for enzymes such as a keto group-introducing enzyme required for producing keto group-containing xanthophylls 
(ketocarotenoids) such as astaxanthin, and to provide a process for producing keto group-containing xanthophylls 
(ketocarotenoids) such as astaxanthin with the microorganisms into which the DNA strands have been introduced. 

The gene cloning method which is often used usually comprising purifying the aimed protein, partially determining 
the amino acid sequence and obtaining genes by a synthetic probe cannot be employed because of the purification of 
the astaxanthin synthetic enzyme being impossible as described above. Thus, the present inventors have paid attention 
to the fact that the cluster of carotenoid synthesis genes in non-photosynthetic bacterium ( Erwiniq) functions in 
Escherichia coli in which lycopene and p-carotene which are believed to be intermediates for biosynthesis of astaxan- 
thin are allowed to produce with combinations of the genes from the gene cluster, and have used Escherichia coli as a 
host for cloning of astaxanthin synthetic genes. The present inventors have also paid attention to the facts that some 
marine bacteria have an astaxanthin-producing ability (Yokoyama, A., Izumida, H., Miki, W.. "Marine bacteria produced 
astaxanthin" 10th International Symposium on Carotenoids. Abstract. CL11-3. 1993). that a series of related genes 
would constitute a cluster in the case of bacteria, and that the gene cluster would be expressed functionally in 
Escherichia coli in the case of bacteria. The present inventors have thus selected the marine bacteria as the gene 
sources They have carried out researches with a combination of these two means and successfully obtained the gene 
group which is required for the biosynthesis of astaxanthin and the other keto group-containing xanthophylls from 
marine bacteria. They have thus accomplished the present invention. In addition, rt has been first elucidated in the 
present invention that the astaxanthin synthesis gene cluster in marine bacteria constitutes a cluster and expresses its 
function in Escherichia coli . and these gene products can utilize p-carotene or lycopene as a substrate. 

The DNA strands according to the present invention are set forth as follows. 

(1) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting the methylene group at the 4-position of the p-ionone ring into a keto group. 

(2) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting the methylene group at the 4-position of the p-ionone ring into a keto group and having an ammo aad 
sequence substantially of amino acid Nos. 1 - 212 which is shown in the SEQ ID NO: 1 

(3) A DNA strand hybridising the DNA strand described in (2) and having a nucleotide sequence which encodes a 
polypeptide having an enzyme activity described in (2). 

(4) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting the methylene group at the 4-position of the p-ionone ring into a keto group and having an amino aad 
sequence substantially of amino acid Nos. 1 - 242 which is shown in the SEQ ID NO: 5. 

(5) A DNA strand hybridizing the DNA strand described in (4) and having a nucleotide sequence which encodes a 
polypeptide having an enzyme activity described in (4). 

(6) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting p-carotene into canthaxanthin via echinenone and having an amino acid sequence substantially of amino 
acid Nos. 1-212 which is shown in the SEQ ID NO: 1 

(7) A DNA strand hybridizing the DNA strand described in (6) and having a nucleotide sequence which encodes a 
polypeptide having an enzyme activity described in (6). 

(8) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting p-carotene into canthaxanthin via echinenone and having an amino acid sequence substantially of amino 
acid Nos. 1 - 242 which is shown in the SEQ ID NO: 5. 

(9) A DNA strand hybridizing the DNA strand described in (8) and having a nucleotide sequence which encodes a 
polypeptide having an enzyme activity described in (8). 
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(10) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting the methylene group at the 4-position of the 3-hydroxy-p-ionone ring into a keto group. 

(11) a DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting the methylene group at the 4-position of the 3-hydroxy-p-ionone ring into a keto group and having an amino 
acid sequence substantially of amino acid Nos. 1-212 which is shown in the SEQ ID NO: 1. 

(12) A DNA strand hybridizing the DNA strand described in (1 1) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (1 1). 

(13) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting the methylene group at the 4-position of the 3-hydroxy-p-ionone ring into a keto group and having an amino 
acid sequence substantially of amino acid Nos. 1 - 242 which is shown in the SEQ ID NO: 5. 

(14) A DNA strand hybridizing the DNA strand described in (1 3) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (13). 

(15) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting zeaxanthin into astaxanthin by way of 4-ketozeaxanthin and having an amino acid sequence substantially 
of amino acid Nos. 1 - 212 which is shown in the SEQ ID NO: 1. 

(16) A DNA strand hybridizing the DNA strand described in (15) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (15). 

(17) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting zeaxanthin into astaxanthin by way of 4-ketozeaxanthin and having an amino acid sequence substantially 
of amino acid Nos. 1 - 242 which is shown in the SEQ ID NO: 5. 

(18) A DNA strand hybridizing the DNA strand described in (17) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (17). 

(19) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for add- 
ing a hydroxyl group to the 3-carbon of the 4-keto-p-ionone ring. 

(20) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for add- 
ing a hydroxyl group to position 3-carbon of the 4-keto-p-ionone ring and having an amino acid sequence substan- 
tially of amino acid Nos. 1-162 which is shown in the SEQ ID NO: 2. 

(21) A DNA strand hybridizing the DNA strand described in (20) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (20). 

(22) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for add- 
ing a hydroxyl group to position 3-carbon of the 4-keto-p-ionone ring and having an amino acid sequence substan- 
tially of amino acid Nos. 1 - 162 which is shown in the SEQ ID NO: 6. 

(23) A DNA strand hybridizing the DNA strand described in (22) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (22). 

(24) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting canthaxanthin into astaxanthin by way of phoenicoxanthin and having an amino acid sequence substantially 
of amino acid Nos. 1-162 which is shown in the SEQ ID NO: 2. 

(25) A DNA strand hybridizing the DNA strand described in (24) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (24). 

(26) A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for con- 
verting canthaxanthin into astaxanthin by way of phoenicoxanthin and having an amino acid sequence substantially 
of amino acid Nos. 1-162 which is shown in the SEQ ID NO: 6. 

(27) A DNA strand hybridizing the DNA strand described in (26) and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity described in (26). 

The present invention also relates to a process for producing xanthophylls. 

That is, the process for producing xanthophylls according to the present invention is set forth below. 

(1) A process for producing a xanthophyll comprising introducing the DNA strand described in any one of the above 
mentioned DNA strands (1) - (9) into a microorganism having a p-carotene-synthesizing ability, culturing the trans- 
formed microorganism in a culture medium, and obtaining canthaxanthin or echinenone from the cultured cells. 

(2) A process for producing a xanthophyll comprising introducing the DNA strand described in any one of the above 
mentioned DNA strands (10) - (18) into a microorganism having a zeaxanthin-synthesizing ability, culturing the 
transformed microorganism in a culture mecfium. and obtaining astaxanthin or 4-ketozeaxanthin from the cultured 
cells. 

(3) A process for producing a xanthophyll comprising introducing the DNA strand described in any one of the above 
mentioned DNA strands (19) - (27) into a microorganism having a canthaxanthin-synthesizing ability, culturing the 
transformed microorganism in a culture medium, and obtaining astaxanthin or phoenicoxanthin from the cultured 
cells. 



4 



07351 37A1_I_> 



10 



15 



EP 0 735 137 A1 

(4) A process for producing a xanthophyll according to any one of the above mentioned processes (1 ) - (3), wherein 
the microorganism is a bacterium or yeast. 

Rriof n^rriptinn of the Drawings 

Fia 1 illustrates diagrammatically the nucleotide sequence of the keto group-introducing enzyme gene (erOV 
gene) of the marine bacterium AnmhArturium aurantiacus sp. nov. MK1 and the amino acid sequence of a polypept.de 

to be F ® nc 2 ^^ r g^ b ^ agrammatica ,| y the nucleotide sequence of the hydroxyl group-introducing enzyme gene (crt_Z 
gene) of the marine bacterium Ajgfibastgtium aurantiacus sp. nov. MK1 and the amino acid sequence of a polypept.de 

to be encoded thereby. . . . 

Fig 3 illustrates diagrammatically the nucleotide sequence of the lycopene-cycl.zing enzyme gene (crtY gene) of 
the marine bacterium Aafflbjjfleoum aurantiacus sp. nov. MK1 and the amino acid sequence of a polypept.de to be 
encoded thereby 

Fig 4 illustrates diagrammatically the continuation of the sequences following to those illustrated in Fig. 3. 
F\g. 5 illustrales diagrammatically the nucleotide sequence of the xanthophyll synthesis gene cluster of the mar.ne 
bacterium Agrobacter ium aurantiacus sp. nov. MK1. 

The letters A - F in Fig. 5 correspond to those in Figs. 1-4. 

Fig 6 illustrates diagramatically the continuation of the sequence following to that .llustrated in Fig. 5. 
Fig 7 illustrates diagrammatically the continuation of the sequence following to that illustrated in Fig. 6. 
Fig 8 illustrates diagrammatically the continuation of the sequence following to that illustrated in Fig. 7. 
Fig 9 illustrates diagrammatically the continuation of the sequence following to that illustrated in Fig. 8. 
Fig. 10 illustrates diagrammatically the carotenoid biosynthetic route of the non-photosynthes.s bacterium Erwmia 
uredovora and the functions of the carotenoid synthetic genes. 

Fig"" 11 illustrates diagrammatically the main xanthophyll biosynthetic routes of marine bacteria Aqrobacter.um 
aurantiacus sp. nov. MK1 and Alcaliaenes sp. PC-1 and the functions of the xanthophyll synthesis genes. 
The function of criY gene, however, has been confirmed only in the former bacterium. 

Fig. 12 illustrates diagrammatically a variety of deletion plasmids containing the xanthophyll synthesis genes (clus- 
ter) of the marine bacterium Aorobacteri um aurantiacus sp. nov. MK1 . ,.•«.«.• 

The letter P represents the promoter of the lag of the vector pBluescript II SK. The positions of cutting with restric- 
tion enzymes are represented by abbreviations as follows: Sa. Sacl; X. Xbal; B, BamHI; P, Est); E. EcaRI; S. Sail; A. 
Apal; K, Kpnl; St. Sjul; N. Nrul; Bg. BgJII; Nc. Ncgl; He. HJncll. 

Fig 13 illustrates diagrammatically the nucleotide sequence of the keto group-.ntroduc.ng enzyme gene (crM 
gene) of the marine bacterium Alcaliaenes sp. PC-1 and the amino acid sequence of a polypeptide to be encoded 
35 thereby 

Fig 1 4 illustrates diagrammatically the continuation of the sequences following to those illustrated in Fig. 13. 

Fig 15 illustrates diagrammatically the nucleotide sequence of the hydroxyl group-introducing enzyme gene (er£ 
gene) of the marine bacterium Alcaliaenes sp. PC-1 and the amino acid sequence of a polypeptide to be encoded 
thereby 

Fig' 1 6 illustrates diagrammatically the nucleotide sequence of the xanthophyll synthetic gene cluster of the marine 
bacterium Alcaliaenes sp. PC-1 and the amino acid sequence of a polypeptide to be encoded thereby. The letters A -D 
in Fig. 16 correspond to those in Figs. 13 - 15. . 

Fig 17 illustrates diagrammatically the continuation of the sequences following to those illustrated in Fig. 16. 
Fig 18 illustrates diagrammatically the continuation of the sequences following to those illustrated in Fig. 17 
Fig. 19 illustrates diagrammatically a variety of deletion plasmids containing the xanthophyll synthetic genes (clus- 
ter) of the marine bacterium Alcaligenes sp. PC-1 . 

The letter P represents the promoter of the iac of the vector pBluescript II SK+. 

Fia 20 illustrates diagrammatically xanthophyll biosynthetic routes containing miner biosynthetic routes in the 
marine bacteria A gmbactenum aurantiacus sp. no. MK1 and Alcaliqenes sp. PC-1 and the functions of the xanthophyll 

so synthesis genes. 

Miner biosynthetic routes are represented by dotted arrows. 

Best Mode for carrying ou f the Invention 

The present invention is intended to provide DNA strands which are useful for synthesizinga keto group-containing 
xanthophylls (ketocarotenoids) such as astaxanthin derived from a marine bacteria Aqrobacterium aurantiacus sp nov. 
MK1 and Alcaliaenes sp. PC-1. and a process for producing keto group-containing xanthophylls (ketocarotenoids), i.e. 
astaxanthin, phoenicoxanthin, 4-ketozeaxanthin, canthaxanthin, and echinenone with use of a microorganism into 
which the DNA strands have been introduced. 
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The DNA strands according to the present invention are in principle illustrated generally by the aforementioned 
D' strands (1), (10) and (19) from the standpoint of the fine chemical-generating reaction, and basically defined by 
tru rcrementioned DNA strands (2), (4), (11), (13), (20) and (22). The specific examples of the DNA strands (2) and 
(4; are the aforementioned DNA strands (6) and (8); the specific examples of the DNA strands (11) and (13) are the 
5 aforementioned DNA strands (1 5) and (1 7); and the specific examples of the DNA strands (20) and (22) are the afore- 
mentioned DNA strands (24) and (26). In this connection, the DNA strands (3), (5), (7), (9). (12), (14), (16), (18), (21), 
(23), (25) and (27) hybridize the DNA strands (2), (4), (6), (8), (1 1), (13), (15), (17), (20), (22), (24) and (26), respectively, 
under a stringent condition. 

The polypeptides encoded by the DNA strands according to the present invention have amino acid sequences sub- 
io stantially in a specific range as described above in SEQ ID NOS: 1 - 2, and 5 - 6 (Figs. 1 - 2, and 13 - 15), e.g. an amino 
acid sequence of ammo acid Nos. 1 - 212 in SEQ ID NOS: 1 (A - B in Fig. 1). In the present invention, four polypeptides 
encoded by these DNA strands, that is four enzymes participating in the xanthophyll -producing reaction) may be mod- 
ified by deletion, substitution or addition in some of the amino acids provided that the polypeptides have the enzyme 

activities as described above (see Example 13). This corresponds to that "amino acid sequences substantially ..." 

15 For instance, an enzyme of which amino acid at the first position (Met) has been deleted is also involved in the polypep- 
tide or enzyme obtained by the modification of the amino acid sequence. In this connection, it is needless to say that 
the DNA strands according to the present invention for encoding the polypeptides also include, in addition to those hav- 
ing nucleotide sequences in a specific range shown in SEQ ID NOS: 1 - 2, and 13-15 (Figs. 1 - 2, and 13 - 15). degen- 
erate isomers encoding the same polypeptides as above except degenerate codons. 



Keto group-introducing enzyme aene (crtW) 

The DNA strands (1 ) - ( 1 8) are genes which encode the keto group-introducing enzymes (referred to hereinafter as 
ertW) . Typical examples of the genes are crtW genes cloned from the marine bacteria Aarobacterium aurantiacus sp. 

25 nov. MK1 or Alcaliaenes sp PC-1. which are the DNA strands comprising the nucleotide sequences encoding the 
polypeptides having the amino acid sequences A - B in Fig. 1 (amino acid Nos. 1 - 212 in SEQ ID NOS: 1) or A - B in 
Figs. 13-14 (amino acid Nos. 1 - 242 in SEQ ID NOS: 5). The crtW gene product (also referred to hereinafter as CrtW) 
has an enzyme activity for converting the 4-methylene group of the p-ionone ring into a keto group, and one of the spe- 
cific examples is an enzyme activity for synthesizing canthaxanthin with p-carotene as a substrate by way of 

30 echinenone (see Fig. 11). In addition, the crtW gene product also has an enzyme activity for converting the 4-methylene 
group of the 3-hydroxy-p-ionone ring into a keto group, and one of the specific examples is an enzyme activity for syn- 
thesizing astaxanthin with zeaxanthin as a substrate by way of 4-ketozeaxanthin (see Fig. 11). In this connection, the 
polypeptides having such enzyme activities and the DNA strands encoding the polypeptides have not hitherto been 
reported, and the polypeptides or the DNA strands encoding the polypeptides has no overall homology to polypeptides 

35 or DNA strands which have hitherto been reported. Moreover, no such information has been reported that one enzyme 
has an activity to convert directly a dihydrocarbonyl group of not only the p-ionone ring and the 3-hydroxy-p-ionone ring 
but also the other compounds into a keto group. Moreover, a homology of CrtW as high as 83% identity at an amino 
acid sequence level was shown between Aarobacterium and Alcaliaenes . 

On the other hand, it is possible to allow a microorganisms such as Escherichia coli or the like to produce p-caro- 

40 tene or zeaxanthin by using the carotenoid synthesis genes of the non-photosynthetic bacterium Erwinia. that is the 
crtE . crtB. crtl and crtY genes of Erwinia afford the microorganism such as Escherichia coli or the like the p-carotene- 
producing ability, and the crtE . crtB. crtl . crtY and crtZ genes of Erwinia afford the microorganisms such as Escherichia 
coli or the like the zeaxanth in-producing ability (see Fig. 10 and Laid-Open Publication of WO91/13078). Thus, the sub- 
strate of CrtW is supplied by the grt gene cluster of Erwinia . so that when additional crtW gene is introduced into the 

45 microorganism such as Escherichia coli or the like which contains the aforementioned crt gene cluster of Erwinia . the 
p-carotene-producing microorganism will produce canthaxanthin by way of echinenone, and the zeaxanthin-producing 
microorganism will produce astaxanthin by way of 4-ketozeaxarrthin. 

Hydroxvl group-introducing enzyme aene (crU) 



The DNA strands (19) - (27) are genes encoding a hydroxy! group-introducing enzyme (referred to hereinafter as 
crtZ) . Typical examples of the genes are £dZ genes cloned from the marine bacteria Aarobacterium aurantiacus sp. 
nov. MK1 or Alcaliaenes sp. PC-1 , which are the DNA strands comprising the nucleotide sequences encoding the 
polypeptides having the amino acid sequences C - D in Fig. 2 (amino acid Nos. 1 - 162 in SEQ ID NOS: 2) or C - D in 
55 Figs. 15 (amino acid Nos. 1 - 162 in SEQ ID NOS: 6). The crtZ gene product (also referred to hereinafter as CrtZ) has 
an enzyme activity for adding a hydroxy! group to the 3-carbon atom of the p-ionone ring, and one of the specific exam- 
ples is an enzyme activity for synthesizing zeaxanthin with use of p-carotene as a substrate by way of p-cryptoxanthin 
(see Fig. 1 1). In addition, the crtZ gene product also has an enzyme activity for adding a hydroxyl group to the 3-carbon 
atom of the 4-keto-p-ionone ring, and one of the specific examples is an enzyme activity for synthesizing astaxanthin 
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with canthaxanthin as a substrate by way of phoenicoxanthin (see Fig. 1 1). In ^^^ r ^ 
the latter enzyme activity and the DNA strand encoding the polypeptide have not hrtherto been reported ^reover. 
CrtZ of Aorotecterium and Alcalioenes showed a high homology with CrtZ of Frwin.3 uredovora (57% and 58% iden- 
tity), respectivelyfat an amino acid sequence level. Also, a high homology of 90% identity at an amino acid sequence 
level was shown between the CrtZ of Aarobacterium and Alcaliqenes. 

It has been described above that it is possible to allow a microorganism such as Escherichia coh or the like to pro- 
duce p-carotene by using the carotenoid synthetic genes of the non-photosynthetic bacterium Eon 
been described above that it is possible to allow a microorganism such as Fschenche col, or the like o produce can- 
thaxanthin by adding crtW thereto. Thus, the substrate of CrtZ of Aqrobacterium or Alcaliqenes is supplied by the crtE, 
cS Sfl and erJY gM of Erwinia (production of p-carotene), and the crJW gene of Agrpbactengm or Alaligspes 
^edlhereto, so that when the crJZ gene of Aqrpb^erigm or Alcaliqenes is introduced into a micr ™!^ 
Fscherichiacoli or the like containing the crt gene group, the p^arotene-producing microorganism wiH^ 
anthin by way of p-cryptoxanthin, and the canthaxanthin-producing microorganism will produce astaxanth.n by way of 
phoenicoxanthin. 

1 ycopene-cynlizino en zyme nene (crtY) 

The DNA strand encoding the amino acid sequence substantially from E to F of Figs. 3 and 4 (amine > acid Nos. 1 - 
386 in SEQ ID NO: 3) is a gene encoding a lycopene-cyclizing enzyme (referred to hereinafter as crtY). A typical exam- 
ple of the gene is the crtY gene cloned from the marine bacterium Anrohacterium aurant.acus sp. nov. MK1 , which is 
The DNA strand comprising the nucleotide sequence encoding the polypeptide having the amino acid sequence E - F 
in Figs 3 and 4. The crjY gene product (also referred to hereinafter as CrtY) has an enzyme activity for synthesizing p- 
carotene with lycopene as a substrate (see Fig. 11). It is possible to allow a microorganism such as Escherichia Wh or 
Eke to produce lycopene by using a carotenoid biosynthesis genes of a non-photosynthetic bacterium Ebm& that 
s the crtE crtB and crtl genes ol Erwinia give a microorganism such as Escherichia coli or the like a lycopene b.osyn- 
thesis^billtyTTee Fig~0. and Laid-Open Publication of WO91/13078). Thus, the substrate of the CrtY of Aqrobye- 
^is^ed by the crt gene group of Bwinia. so that when the crtY « *nt**W ^^'Zn 
n^oorganism such as F^herichia coli or the like containing the crl gene group, it is possible to allow the m.croorgan- 

E S^rSSX CrtY of Anrobacterium has a sign. icant homology of 44.3*/o identity to the CrtY of Er*irM 
uredovora at the amino acid sequence level, and these CrtY enzymes also have the same enzymatic funct.on (see Figs. 
10 and 11). 

Bacteriologic al properties of marine bacteria 

Thr marine bacteria Ag rnharterium aurantiacus so. nov. MK1 and Alcaliqenes sp. PC-1 as the sources of the xan- 
thophyll synthetic genes show the following bacteriological properties. 

(Aarobacterium aurantiacus sp. nov. MK1 ) 

40 

(1) Morphology 

Form and size of bacterium: rod, 0.9 urn x 1 .2 um; 
Motility: yes; 

Flagellum: peripheric flagellum; 
45 Polymorphism of cell: none; 
Sporogenesis: none; 
Gram staining: negative. 
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(2) Growths in culture media 

Broth agar plate culture: non-diffusive circular orange colonies having a gloss are formed. 
Broth agar slant culture: a non-diffusive orange band having a gloss is formed. 
Broth liquid culture: homogeneous growth all over the culture medium with a color in orange. 
Broth gelatin stab culture: growth over the surface around the stab pore. 



55 (3) Physiological properties 
Reduction of nitrate: positive: 
Denitrification reaction: negative; 
Formation of indole: negative; 
Utilization of citric acid: negative; 
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Formation of pigments: fat-soluble reddish orange pigment; 

Urease activity: negative; 

Oxidase activity: positive; 

Catalase activity: positive; 
5 p-Glucosidase activity (esculin degradability): positive; 

p-Galactosidase activity: positive; 

Growth range: pH, 5 - 9; temperature, 10 - 40°C: 

Behavior towards oxygen: aerobic; 

Durability to seawater: positive; 
70 O - F test: oxidation; 

Anabolic ability of saccharides: 

Positive D-glucose, D-mannose, D-galactose, D-fructose, lactose, maltose, sucrose, glycogen, N-acetyl-D-glu- 

cosamme. 

Negative: L-arabinose. D-mannrtol, inositol, L-rhamnose, D-sorbitol; 
75 Anabolic ability of organic acids: 
Positive lactate; 

Negative: citrate, malate, gluconate, caprinate, succinate, adipate; 
Anabolic ability of the other organic materials: 

Positive inosine, uridine, glucose-1 -phosphate, glucose-6-phosphate; 
20 Negative: gelatin. L-arginine, DNA, casein. 

<A»cahggnes sp. PC-1> 

(1) Morphology 

25 Form and size of bacterium: short rod, 1.4 pm; 
Motility: yes; 

Flagellum: peripheric flagellum; 
Polymorphism of cell: none; 
Sporogenesis: none; 
30 Gram staining: negative. 

(2) Growths in culture media 

Broth agar plate culture: non-diffusive circular orange colonies having a gloss are formed. 
Broth agar slant culture: a non-diffusive orange band having a gloss is formed. 
35 Broth liquid culture: homogeneous growth all over the culture medium with a color in orange. 
Broth gelatin stab culture: growth over the surface around the stab pore. 

(3) Physiological properties 

Formation of pigments: fat-soluble reddish orange pigment; 
40 Oxidase activity: positive; 
Catalase activity: positive; 

Growth range: pH, 5 - 9; temperature, 10 - 40°C: 
Behavior towards oxygen: aerobic; 
Durability to seawater: positive; 
45 O - F test: oxidation; 

Degradability of gelatin: negative. 

XanthoDhvll synthetic gene duster of the other mari ne bacteria 

so It has hitherto been reported that 16 marine bacteria have an ability to synthesize ketocarotenoids such as astax- 

anthin and the like (Yokoyama, A., Izumida, H., Miki, W., "Marine bacteria produced astaxanthin", 10th International 
Symposium on Carotenoids. Abstract. CL11-3. 1993). If either of the ol genes of the aforementioned marine bacteria 
Aorobacterium aurantiacus sp. nov. MK-1 or Alcalioenes sp. PC-1 is used as a probe, the gene cluster playing a role of 
the biosynthesis of ketocarotenoids such as astaxanthin and the like should be obtained from the other astaxanthin pro- 

55 ducing marine bacteria by using the homology of the genes. In fact, the present inventors have successfully obtained 
the crtW and crtZ genes as the strongly hybridizing DNA fragments from the chromosomal DNA of Alcalioenes PC-1 
with use of a DNA fragment containing crtW and crtZ of Ao. aurantiacus sp. nov. MK1 as a probe (see Examples as for 
the details). Furthermore, when Arteromonas SD-402 was selected from the remaining 14 marine bacteria having an 
astaxanthin synthetic ability and a chromosomal DNA was prepared therewith and subjected to the Southern hybridi- 
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zation experiment with a DNA fragment containing crtW and crtZ of Aq. aurantiacus sp. nov. MK1 . the probe hybridized 
with the bands derived from the chromosomal DNA of the marine bacteria. The DNA strands according to the present 
invention also includes a DNA strand which hybridizes with the DNA strands (2), (4). (6). (8). (1 1). (13). (15). (17). (20). 
(22). (24) and (26). 

Arg! nsitinn of DNA strands 

Although one of the methods for obtaining the DNA strand having a nucleotide sequence which encodes the amino 
acid sequence of each enzyme described above is to chemically synthesize at least a part of the strand length accord- 
ing to the method for synthesizing a nucleic acid, it is believed more preferable than the chemical synthetic method to 
obtain the DNA strand by using the total DNA having been digested with an appropriate restriction enzyme to prepare 
a library in Escherichia coli . from which library the DNA strand is obtained by the methods conventionally used in the 
art of genetic engineering such as a hybridization method with an appropriate probe (see the xanthophyll synthetic 
gene cluster of the other marine bacteria). 

Tra nsformation of an microorganis m such as Escherichia coli and gene expression 

A variety of xanthophylls can be prepared by introducing the present DNA strands described above into appropriate 
microorganisms such as bacteria, for example Escherichia coli. Zymomonas mobilis and Aomhacterium tumefaaens . 
so and yeasts for sample Saccharo mvces cerivisiae. 

The outline for introducing an foreign gene into a preferred microorganism is described below. 
The procedure or method for introducing and expressing the foreign gene in a microorganism such as Escherichia 
coli or the like comprises the ones usually used in the art of genetic engineering in addition to those described below in 
the" present invention and may be carried out according to the procedure or method (see. e.g. "Vectors tor Cloning 
Genes". Methods in Enzymology. 216. p. 469-631. 1992. Academic Press, and "Other Bacterial Systems , Methods in 
Enzymology. 204. p. 305-636. 1991 . Academic Press). 

Escherichia coli ) 
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The method for introducing foreign genes into Escherichia coli includes several efficient methods such as the Hana- 
han's method and the rubidium method, and the foreign genes may be introduced according to these methods (see. for 
example Sambrook. J.. Fritsch. E.F.. Maniatjs, T. "Molecular Cloning - A Laboratory Manual". Cold Spring Harbor ^Lab- 
oratory Press 1 989). While foreign genes in Escherichia coli may be expressed according to the conventional methods 
(see for exarnple, "Molecular Cloning - A Laboratory Manual"), the expression can be carried out for example with a 
vector for Escherichia coli having a lac promoter in the pUC or pBluescript series. The present inventors have used a 
vector pBluescrip II SK or KS for Escherichia coli having a lac promoter and the like to insert the c^tL erJZ and effir 
genes of AnrohartPrium aurantiacus sp. nov. MK1 and the crtW and crtZ genes of Alcaliqenes sp. PC-1 and allowed to 
express these genes in Escherichia coli. 



40 <Yeast> 
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The method for introducing foreign genes into yeast Saocharomyces cerivisiae includes the methods which have 
already been established such as the lithium method and the like, and the introduction may be carried out according to 
these methods (see, for example. Ed. Yuichi AWyama. compiled by Bio-industry Association. "New B.otechnology of 
Yeast" published by IGAKU SHUPPAN CENTER). Foreign genes can be expressed in yeast by using a promoter and 
a terminator such as PGK and GPD to construct an expression cassette in which the foreign gene is inserted between 
the promoter and the terminator so that transcription is led through, and inserting the expression cassette into a vector 
such as the YRp system which is a multi-copy vector for yeast having the ARS sequence of the yeast chromosome as 
the replication origin, the YEp system which is a multi-copy vector for yeast having the replication origin of the 2 urn 
DNA of yeast and the Yip system which is a vector for integrating a yeast chromosome having no replication origin i of 
yeast (see "New Biotechnology of Yeast", published by IGAKU SHUPPAN CENTER, ibid, NIPPON NOGEI-KAGAKU 
KAI ABC Series "Genetic Engineering for Producing Materials", published by ASAKURA SHOTEN; and Yamano. S.. 
Ishii. T. Nakagawa. M.. Ikenaga. H.. Misawa. N.. "Metabolic Engineering for Production of p-carotene and lycopene in 
Saccharomyces cerevisiae". Biosci. Biotech. Biochem.. 58, p. 1112-1114. 1994). 

<Zvmomonas mobilis ) 

Foreign genes can be introduced into an ethanol -producing bacterium 7 ymompnas mobilis by the conjugal transf er 
method which is common to Gram-negative bacteria, and the foreign genes can be expressed by using a vector pZA22 
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for Zvmomonas mobilis (see Katsumi Nakamura. "Molecular Breeding of Zvmomonas mobilis ". Nippon Nogei-Kagaku 
Kaishi, 63, p. 1016-1018, 1989; and Misawa, N., Yamano, S., Ikanaga, H., "Production of p-Carotene in Zymomonas 
mobilis and Agrobacterium tumefaciens by Introduction of the Biosynthesis Genes from Erwinia uredovora ". Appl. Envi- 
ron. Microbiol., 57, p.1847-1849, 1991). 



(Aorobacterium tumefaciens ) 

Foreign genes can be introduced into a plant pathogenic bacterium Aarobacterium tumefaciens by the conjugal 
transfer method which is common to Gram-negative bacteria, and the foreign genes can be expressed by using a vector 
10 pBI121 for a bacterium such as Agrobacterium tumefaciens (see Misawa, N., Yamano, S., Ikenaga, H., "Production of 
p-Carotene in Zymomonas mobilis and Agrobacteriu m tumefaciens by Introduction of the Biosynthesis Genes from 
Erwinia uredovora ". Appl. Environ. Microbiol., 57, p. 1847-1849, 1991). 

Production of xan thophvlls bv microorganisms 



The gene cluster for the synthesis of ketocarotenoids such as astaxanthin derived from a marine bacterium can be 
introduced and expressed by the procedure or method described above for introducing and expressing an foreign gene 
in a microorganism. 

Farnesyl pyrophosphate (FPP) is a substrate which is common not only to carotenoids but also to other terpenoids 

20 such as sesquiterpenes, triterpenes, sterols, hopanols and the like. In general, microorganisms synthesize terpenoids 
even if they cannot synthesize carotenoids, so that all of the microorganisms should basically have FPP as an interme- 
diate metabolite. Furthermore, the carotenoid synthesis gene cluster of a non-photosynthetic bacterium Erwinia has an 
ability to synthesize the substrates of the cri gene products of Agrobacterium aurantiacus sp. nov. MK1 or AlcaMggngs 
sp. PC-1 by using FPP as a substrate (see Fig. 10). The present inventors have already confirmed that when the group 

25 of crt genes of Erwinia is introduced into not only Escherichia coti but also the aforementioned microorganisms, that is 
the yeast Saccharomvces cerevisiae. the ethanol producing bacterium Zvmomonas mobilis. or the plant pathogenic 
bacterium Aorobacterium tumefaciens. carotenoids such as p-carotene and the like can be produced, as was expected, 
by these microorganisms (Yamano, S., Ishii, T, Nakagawa, M., Ikenaga. H., Misawa, N„ "Metabolic Engineering for Pro- 
duction of p-Carotene and Lycopene in Saccharomvces cerevisiae ". Biosci. Biotech. Biochem., 58, p. 1 1 12-1 1 1 4, 1 994; 

30 Misawa, N., Yamano, S., Ikenaga, H., "Production of p-Carotene in Zymomonas mobilis and Aorobacterium tumefa- 
ciens bv Introduction of the Biosynthetic Genes from Erwinia uredovora ". Appl. Environ. Microbiol.. 57, p. 1847-1849, 
1991 ; and Japanese Patent Application No. 58786/1991 (Japanese Patent Application No. 53255/1990) by the present 
inventors: "DNA Strands useful for the Synthesis of Carotenoids"). 

Thus, it should be possible in principle to allow all of the microorganisms, in which the gene introduction and 

35 expression system has been established, to produce ketocarotenoids such as astaxanthin and the like by introducing 
the combination of the carotenoid synthesis gene cluster derived from Erwinia and the DNA strands according to the 
present invention (typically the carotenoid synthesis gene cluster derived from Aarobacterium aurantiacus sp. nov. MK1 
or Alcaliaenes sp. PC-1) at the same time into the same microorganism. The process for producing a variety of keto- 
carotenoids in microorganisms are described below 



(Production of canthaxanthin and echinenone) 

It is possible to produce canthaxanthin as a final product and echinenone as an intermediate metabolite by intro- 
ducing into a microorganism such as Escherichia coti and expressing the crtE . crtB. ertl and crtY genes of Erwinia ure- 

45 dovora required for the synthesis of p-carotene and any one of the DNA strands of the present invention (1) - (9) which 
is a keto group-introducing enzyme gene (typically, the crtW gene of Agrobacterium aurantiacus sp. nov. MK1 or AlGfli 
ligenes PC-1). The yields or the ratio of canthaxanthin and echinenone can be changed by controlling the expression 
level of the DNA strand (crtW gene) or examining the culturing conditions of a microorganism having the DNA strand. 
Two embodiments in Escherichia coli are described below, and more details will be illustrated in Examples. 

so A plasmid pACCAFM 6AcrtX that a fragment containing the crtE. crtB. crt! and grtY genes of Erwinia uredovora has 

been inserted into the Escherichia coli vector pACYC184 and a plasmid pAK916 that a fragment containing the crtW 
gene of Agrobacterium aurantiacus sp. nov. MK1 has been inserted into the Escherichia coli vector pBluescript II SK- 
were introduced into Escherichia coli JM101 and cultured to the stationary phase to collect bacterial cells and to extract 
carotenoid pigments. The extracted pigments comprised 94% of canthaxanthin and 6% of echinenone. Also, canthax- 

55 anthin was obtained in.a yield of 3 mg starting from 2 liters of the culture solution. 

A plasmid pACCARI 6AcrtX that a fragment containing the crtE. crtB. ertl and crtY genes of Erwinia uredovora has 
been inserted into the Escherichia coli vector pACYCl84 and a plasmid pPCl7-3 that a fragment containing the crtW 
gene of Alcaligenes PC-1 has been inserted into the Escherichia coli vector pBluescript II SK+ were introduced into 
Escherichia coli JM 1 01 and cultured to the stationary phase to collect bacterial cells and to extract carotenoid pigments. 
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The extracted pigments comprised 40% of canthaxanthin and 50% of echinenone. The remainder comprised 10% of 
unreacted p-carotene. 

(Production of astaxanthin and 4-ketozeaxanthin) 

It is possible to produce astaxanthin as a final product and 4-ketozeaxanthin as an intermediate metabolite by intro- 
ducing into a microorganism such as Escherichia coli or the like and expressing the crtE, crtB, crtl, crtY and crtZ genes 
of Frwinia uredovora required for the synthesis of zeaxanthin and any one of the DNA strands of the present invention . 
(10) - (18) which is a keto group-introducing enzyme gene (typically, the crtW gene of Aqrobacterium aurantiacus sp. 
nov MK1 or Alcaliqenes PC-1). The yields or the ratio of astaxanthin and 4-ketozeoxanthin can be changed by control- 
ling the expression level of the DNA strand (crtWoene) or examining the culturing conditions of a microorganism having 
the DNA strand. 

Two embodiments in Escherichia coli are described below, and more details will be illustrated in Examples. 
A plasmid pACCAR25AcrtX that a fragment containing the erlE, crjB, cm, gUY and £fiZ genes of Erwinia gredpvprq 
has been inserted into the Escherichia coli vector pACYC184 and a plasmid pAK916 that a fragment containing the 
crt w gene of A n aurantiacus sp. nov. MK1 has been inserted into the Escherichia poli vector pBluescript II SK- were 
introduced into Escherichia coli JM101 and cultured to the stationary phase to collect bacterial cells and to extract car- 
otenoid pigments. The yield of the extracted pigments was 1 .7 mg of astaxanthin and 1 .5 mg of 4-ketozeaxanthin based 
on 2 liters of the culture solution. 
20 A plasmid P ACCAR25AcrtX that a fragment containing the crtE, crtB, crtl, crtY and crtZ genes of Erwmia uredovora 
has been inserted into the Escherichia coli vector pACYCl84 and a plasmid pPCl7-3 that a fragment containing the 
crtw gene of Alcalioenes PC-1 has been inserted into the Escherichia coli vector pBluescript II SK+ were introduced 
into Escherichia coli JM101 and cultured to the stationary phase to collect bacterial cells and to extract carotenoid pig- 
ments. The yield of the extracted pigments was about 1 mg of astaxanthin and 4-ketozeaxanthin, respectively based on 
25 2 liters of the culture solution. 

(Production of astaxanthin and phoenicoxanthin) 

It is possible to produce astaxanthin as a final product and phoenicoxanthin as an intermediate metabolite by intro- 
30 ducing into a microorganism such as Escherichia coli or the like and expressing the crtE. crtB, crtl and crfV; genes of 
Erwinia uredovora required for the synthesis of p-carotene, any one of the DNA strands of the present invention (1 ) - (9) 
which is a keto group-introducing enzyme gene (typically, the crtW gene of Aqrobacterium aurantiacus sp. nov. MK1 or 
Alcalioenes PC-1), and any one of the DNA strands of the present invention (19) - (27) which is a hydroxyl group-intro- 
ducing enzyme gene (typically, the crJZ gene of An aurantiacus sp. nov. MK1 or Alcaliqenes PC-1). The yields or the 
35 ratio of astaxanthin and phoenicoxanthin can be changed by controlling the expression level of the DNA strands (cdW 
and crtZ genes) or examining the culturing conditions of a microorganism having the DNA strands. An embodiment in 
Escherichia coli are described below, and more details will be illustrated in Examples. 

A plasmid pACCAR16AcrtX that a fragment containing the crtE, crtB, crtl and crtY genes of Erwinia uredovora has 
been inserted into the Escherichia coli vector P ACYC184 and a plasmid pAK96K that a fragment containing the crtw 
40 and crtZ genes of Aa. aurantiacus sp. nov. MK1 has been inserted into the Escherichia coli vector pBluescript II SK- 
were introduced into Escherichia coli JM101 and cultured to the stationary phase to collect bacterial cells and to extract 
carotenoid pigments. The yield of the extracted pigments comprised was 3 mg of astaxanthin and 2 mg of phoenicox- 
anthin starting from 4 liters of the culture solution. 

45 Deposition of microorganisms 

Microorganisms as the gene sources of the DNA strands of the present invention and Escherichia coli carrying the 
isolated genes (the DNA strands of the present invention) have been deposited to National Institute of Bioscience and 
Human Technology, Agency of Industrial Science and Technology. 



50 



(i) Aqrobacterium aurantiacus so. nov. MK1 
Deposition No: FERM BP-4506 
Entrusted Date: December 20, 1993 

(ii) Escherichia coli JM101 (pAccrt-EIB, pAK92) 
55 Deposition No: FERM BP-4505 

Entrusted Date: December 20, 1993 

(iii) Alcalioense sp. PC-1 
Deposition No: FERM BP-4760 
Entrusted Date: July 27, 1994 
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(iv) Escherichia coli p: pPC1 7 
Deposition No: FERM BP-4761 
Entrusted Date: July 27, 1994 



5 



Examples 



The present invention is further described more specifically with reference to the following examples without restric- 
tion of the invention. In addition, the ordinary experiments of gene manipulation employed herein is based on the stand- 
ard methods (SambrooK J.. Fritsch, E.F., Maniatis, T. "Molecular Cloning - A Laboratory Manual", Cold Spring Harbor 
10 Laboratory Press, 1989), unless otherwise specified. 

Example 1 : Preparation of chromosomal DNA 

Chromosomal DNAs were prepared from three marine bacterial strains, i.e. Agrobacterium aurantiacus sp. nov. 

75 MK1 , Alcaliaenes sp. PC-1 . and Alteromonas SD-402 (Yokoyama, A., Izumida, H., Miki, W., "Marine bacteria produced 
astaxanthin", 1 0th International Symposium on Carotenoids, Abstract. CL1 1 -3, 1 993). After each of these marine bac- 
teria was grown in 200 ml of a culture medium (a culture medium prepared according to the instruction of "Marine Broth" 
manufactured by DIFCO) at 25°C for 4 days to the stationary phase, the bacterial cells were collected, washed with a 
TES buffer (20 mM Tris, 10 mM EDTA, 0.1 M NaCI, pH 8), subjected to heat treatment at 68°C for 15 minutes, and sus- 

20 pended into the solution I (50 mM glucose, 25 mM Tris. 1 0 mM EDTA, pH 8) containing 5 mg/ml of lysozyme (manufac- 
tured by SEIKAGAKU KOGYO) and 100 Mg/ml of RNase A (manufactured by Sigma). After incubation of the suspension 
at 37°C for 1 hour, Proteinase K (manufactured by Boehringer-Mannheim) was added and the mixture was incubated 
at 37°C for 10 minutes. After SARCOSIL (N-lauroylsarcosine Na, manufactured by Sigma) was then added at the f inal 
concentration of 1% and the mixture was sufficiently mixed, it was incubated at 37°C for several hours. The mixture was 

25 extracted several times with phenol/chloroform, and ethanol in a two-time amount was added slowly. Chromosomal 
DNA thus deposited was wound around a glass rod, rinsed with 70% ethanol and dissolved in 2 ml of a TE buffer (10 
mM Tris. 1 mM EDTA. pH 8) to prepare a chromosomal DNA solution. 

Example 2: Preparation of hosts for a cosmid library 



(1) Preparation of phytoene-producing Escherichia coli 

After the removal of the BstEll (1235) - Eco521 (4926) fragment from a plasmid pCAR16 having a carotenoid syn- 
thesis gene cluster except the criZ gene of Erwinia uredovora (Misawa, N.,Nakagawa. M., Kobayashi, K., Yamano. S. t 

35 Izawa. Y, Nakamura, K.. Harashima. K., "Elucidation of the Erwinia uredovora Carotenoid Biosynthetic Pathway by 
Functional Analysis of Gene Porducts expressed in Escherichia coli ". J. Bacterid., 172, p. 6704-6712, 1990; and Jap- 
anese Patent Application No. 58786/1991 (Japanese Patent Application No. 53255/1990): "DNA Strands useful for the 
Synthesis of Carotenoids"), a 2.3 kb Asp718 (Kpnl) - EcoRI fragment containing the SHE and cflB genes required for 
the production of phytoenes was cut out. This fragment was then inserted into the EcqRV site of the E. poli vector 

40 pACYC184 to give an aimed plasmid (pACCRT-EB). The bacterium E. coli containing pACCRT-EB exhibits resistance 
to an antibiotic chloramphenicol (Cm r ) and produces phytoenes (Unden, H., Misawa, N., Chamovitz, D., Pecker, I., Hir- 
schberg, J., Sandmann, G., "Functional Complementation in Escherichia coli of Different Phytoene Desaturase Genes 
and Analysis of Accumulated Carotenes", Z. Naturforsch. , 46c, 1045-1051, 1991). 

45 (2) Preparation of lycopene-producing Escherichia coli 

After the removal of the BslEll (1235) - Sna BI (3497) fragment from a plasmid pCAR16 having a carotenoid syn- 
thesis gene cluster except the crtZ gene of Erwinia uredovora , a 3.75 kb A§p_718 (Kpnl) - EcoRI fragment containing 
the crtE . crtl and crtB genes required for the production of lycopene was cut out. This fragment was then inserted into 
so the E£Q_RV site of the E. coli vector pACYCl84 to give an aimed plasmid (pACCRT-EIB). The bacterium E. coli contain- 
ing pACCRT-EIB exhibits Cm r and produces lycopene (Cunningham Jr, FX., Chamovitz, D., Misawa, N., Gatt, E., Hir- 
schberg. J., "Cloning and Functional Expression in Escherichia coli of Cyanobacterial Gene for Lycopene Cyclase, the 
Enzyme that catalyzes the Biosynthesis of p-Carotenes", FEBS Lett., 328, 130-138. 1993). 

55 (3) Preparation of p-carotene-producing Escherichia coli 

After the otX gene was inactivated by subjecting a plasmid pCAR16 having a carotenoid synthesis gene cluster 
except the q3Z gene of Erwinia uredovora to digestion with restriction enzyme BstEN. the Klenow fragment treatment 
and the ligation reaction, a 6.0 kb Asp 718 (Kpnl) - EcoRI fragment containing crtE, QlY, oil and criB genes required 
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for the production of p-carotene was cut out. This fragment was then inserted into the EcoRV site of the E^seli vector 
pACYCl 84 to give an aimed plasmid (referred to hereinafter as P ACCAR16AcrtX). The bacterium Ecgli containing 
oACCAR16AcrtX exhibits Cm r and produces p-carctene. In this connection, the restriction enzyme and enzymes used 
for genetic manipulation have been purchased from TAKARA SHUZO (K.K.) or Boehringer-Mannheim. 

FYamnle 3: p re paration of a cosmid library and acquisi t ion of Escherichia coli which exhibits orange in color 

After the restriction enzyme Sau3AI was addec | j n an amount of one unit to 25 ng of the chromosomal DNA of Agn> 
hartprinm aurantiacus sp nov. MK1 . the mixture was incubated at 37°C for 15 minutes and heat treated at 68°C for 10 
minutes to inactivate the restriction enzyme. Under the condition, many partially digested fragments with £au3AI were 
obtained at about 40 kb. The cosmid vector pJBB (resistant to ampicillin (ApO) which had been subjected to BamHI 
diaestion and alkaline phosphatase treatment and the right arm (shorter fragment) of pJBB which had been digested 
with Sall/BamHI and then recovered from the gel were mixed with a part of the above §au3AI partial fragments, and 
lioated at 1 2°C overnight. In this connection. pJ BB has been purchased from Amersham. 

Phage particles were obtained in an amount sufficient for preparing a cosmid library by the invjtrg packaging with 
a Gioapack Gold (manufactured by Stratagene; available from Funakoshi) using the DNA above ligated. 

After Escherichia coli DH1 (ATCC33849) and Escherichia coli DH1 . each of which has one of the three plasm.ds 
oreoared in Example 2, were infected with the phage particles, these bacteria were diluted so that 100 - 300 colonies 
were found on a plate, plated on LB containing appropriate antibiotics (1% trypton, 0.5% yeast extract. 1% NaCI), and 
cultured at 37°C or room temperature for a period of overnight to several days. 

As a result in cosmid libraries having the simple Escherichia coli (beige) or the phytoene-producing Escherichia 
coli (beige) with pACCRT-EB as a host, no colonies with changed color were obtained notwithstanding the screening of 
aten thousand or more of the colonies for respective libraries. On the other hand, in cosmid libraries having the lyco- 
pene-producing Escherichia coli (light red) with pACCRT-EIB or the p-carotene-producing Escherichia coli (yellow) with 
pACCARI 6AcrtX as a host, colonies exhibiting orange have appeared in a proportion of one strain to several hundred 
colonies respectively. Most of these transformed Fsrherichia coli strains which exhibits orange contained plasmid pJB8 
in which about 40 kb partially digested Sau3 Al fragments were cloned. It is also understood from the fact that no colo- 
nies with changed color appeared in cosmid libraries having the simple Escherichia coli or the phytoene-producing 
Escherichia coli with pACCRT-EB as a host, that Escherichia coli having an ability of producing a carotenoid synthetic 
intermediate of the later steps of at least phytoene should be used as a host for the purpose of express.on-clon.ng the 
xanthophyll synthesis gene cluster from the chromosomal DNA of Aqrobacterium aurantiacus sp. nov. MK1. 

Example 4: L nr fl ii 7 ation of a fragment co n taining an orange pigment synthesis gene cluster 

When individual several ten colonies out of the orange colonies obtained in cosmid libraries having the lycopene- 
producing Escherichia coli (light red) with pACCRT-EIB or the p-carotene-producing Escherichia col. (yellow) with 
pACCARI 6AcrtX as a host were selected to analyze the plasmids, 33 kb - 47 hb fragments partially d.gested with 
Sau3AI were inserted in vector pJB8 in all of the colonies except one strain. The remaining one strain (lycopene-pro- 
during Fscherichia coli as a host) contains a plasmid. in which a 3.9 kb fragment partially digested with Sau3AI was 
inserted in pJB8 (referred to hereinafter as plasmid pAK9). This was considered to be the one formed by the ULyiyo. 
deletion of the inserted fragment after the infection to Escherichia coli. The same pigment (identified as astaxanth.n .n 
Example 6) as that in the orange colonies obtained from the other cosmid libraries was successfully synthesized with 
the lycopene-producing Escherichia coli having pAK9, pAK9 was used as a material in the following analyses. 

Example 5: nomination of the nucleotide sequen c e in the orange pigment synthesis gene cluster 

A 3 9 kb EcoRI inserted fragment prepared from pAK9 was inserted into the EcqRI site of the Escherichia coli vec- 
tor pBluescrip II SK + to give two plasmids (pAK91 and pAK92) with the opposite directions of the fragment to the vector. 
The restriction enzyme map of one of the plasmids (pAK92) is illustrated in Fig. 12. When P AK92 was introduced into 
the lycopene-producing Escherichia coli . orange colonies were obtained as a result of the synthesis of astaxanth.n 
(Example 6) However, no ability for synthesizing new pigments was afforded even if pAK91 was introduced into the lyc- 
opene-producing Escherichia coli . It was thus considered that the pigment synthesis gene cluster in the plasmid pAK92 
has the same direction as that of the jac promoter of the vector. Next, each of a 2.7 kb Pstl fragment obtained by the 
Pstl digestion of pAK9L a 2.9 kb BamHI fragment obtained by the EamHI digestion of pAK92. and 2.3 kb and 1.6 kb 
Sail fragments obtained by the Sail digestion of pAK92 was cloned into the vector pBluescrip II SK-. The restr.ct.on 
maps of plasmids referred to as pAK94, pAK96. pAK98. pAK910. pAK93, and pAK95 are illustrated in F.g. 12 The plas- 
mids PAK94 pAK96 pAK98 and pAK910 have the pigment synthesis gene cluster in the same direction as that of the 
lac promoter of the vector, while the plasmids pAK93 and pAK95 have the pigment synthesis gene cluster .n the oppo- 
site direction to that of the promoter. 
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It was found that when the plasmid pAK96 having a 2.9 kb Bam HI fragment was introduced into the lycopene-pro- 
ducing Escherichia coli. the transformartt also synthesized astaxanthin as in the case when the plasmid pAK92 having 
a 3.9 kb Eco RI fragment was introduced (Example 6), so that the DNA sequence of the 2.9 kb Bam HI fragment was 
determined. 

5 The DNA sequence was determined by preparing deletion mutants of the 2.9 kb Bam HI fragment from the normal 

and opposite directions and determining the sequence using clones having various lengths of deletions. The deletion 
mutants were prepared from the four plasmids pAK96, pAK98, pAK93 and pAK95 according to the following procedure: 
Each of the plasmids, 10 fig, was decomposed with Sac I and Xba l and extracted with phenol/chloroform to recover 
DNA by ethanol precipitation. Each of DNA was dissolved in 100 \i\ of Exolll buffer (50 mM Tris-HCI, 100 mM NaCI, 5 

w mM MgCI 2 , 10 mM 2-mercaptoethanol, pH 8.0), 180 units of Exolll nuclease was added, and the mixture was main- 
tained at 37°C. A 10 |il portion was sampled at every 1 minute, and two samples were transferred into a tube in which 
20 nl of MB buffer (40 mM sodium acetate, 100 mM NaCI, 2 mM ZnCI 2 , 10% glycerol, pH 4.5) is contained and which 
is placed on ice. After completion of the sampling, five tubes thus obtained were maintained at 65°C for 10 minutes to 
inactivate the enzyme, five units of mung bean nuclease were added, and the mixtures were maintained at 37°C for 30 

75 minutes. After the reaction, five DNA fragments different from each other in the degrees of deletion were recovered for 
each plasmid by agarose gel electrophoresis. The DNA fragments thus recovered was blunt ended with the Klenow 
fragment, subjected to the ligation reaction at 16°C overnight, and Escherichia coli JM109 was transformed. A single 
stranded DNA was prepared from each of various clones thus obtained with a helper phage M13K07, and subjected to 
the sequence reaction with a fluorescent primer cycle-sequence kit available from Applied Biosystem (K.K.), and the 

20 DNA sequence was determined with an automatic sequencer. 

The DNA sequence comprising 2886 base pairs (bp) thus obtained is illustrated in Figs. 5 - 9 (SEQ ID NO: 4). As 
a result of examining an open reading frame having a ribosome binding site in front of the initiation codon, three open 
reading frames which can encode the corresponding proteins (A - B (nucleotide positions 229 - 864 of SEQ ID NO: 4), 
C - D (nucleotide positions 864 - 1349), E - F (nucleotide positions 1349 - 2506) in Figs. 5 - 9) were found at the posi- 

25 tions where the three xanthophyll synthesis genes crtW. crtZ and crtY are expected to be present. For the two open 
reading frames of A - B and E - F, the initiating codon is GTG, and for the remaining open reading frame C - D, it is ATG. 

Example 6: Identification of the orange pigment 

30 The lycopene-producing Escherichia coli JM101 having pAK92 or pAK96 introduced thereinto (Escherichia coli 
(pACCRT-EIB, pAK92 or pAK96) ; exhibiting orange) or the p-carotene-producing Escherichia coli JM1 01 having pAK94 
or pAK96K (Fig. 12) introduced thereinto (Escherichia coli (pACCAR16AcrtX, pAK94 or pAK96K); exhibiting orange) 
was cultured in 4 liters of a 2YT culture medium (1 .6% trypton, 1% yeast extract, 0.5% NaCI) containing 150 ^ig/ml of 
ampicillin (Ap, manufactured by Meiji Seika) and 30 fig/ml of chloramphenicol (Cm, manufactured by Sankyo) at 37°C 

35 for 18 hours. Bacterial cells collected from the culture solution was extracted with 600 ml of acetone, concentrated, 
extracted twice with 400 ml of chloroform/methanol (9/1), and concentrated to dryness. Then, thin layer chromatogra- 
phy (TLC) was conducted by dissolving the residue in a small amount of chloroform/methanol (9/1) and developing on 
a silica gel plate for preparative TLC manufactured by Merck with chloroform/methanol (15/1). The original orange pig- 
ment was separated into three spots at the Rf values of 0.72, 0.82 and 0.91 by TLC. The pigment of the darkest spot at 

40 Rf 0.72 corresponding to 50% of the total amount of orange pigment and the pigment of secondly darker spot at Rf 0.82 
were scratched off from the TLC plate, dissolved in a small amount of chloroform/methanol (9/1) or methanol, and chro- 
matographed on a Sephadex LH-20 column (15 x 300 mm) with an eluent of chloroform/methanol (9/1) or methanol to 
give purified materials in a yield of 3 mg (Rf 0.72) and 2 mg (Rf 0.82), respectively. 

It has been elucidated from the results of the UV-visible, 1 H-NMR and FD-MS (m/e 596) spectra that the pigment 

45 at Rf 0.72 has the same planar structure as that of astaxanthin. When the pigment was dissolved in diethyl ether : 2- 
propanol : ethanol (5 : 5 : 2) to measure the CD spectrum, it was proved to have stereochemical configuration of 3S, 
3*8, and thus identified as astaxanthin; see Fig. 1 1 for the structural formula). Also, the pigment at Rf 0.82 was identified 
as phoenicoxanthin (see Fig. 1 1 for the structural formula) from the results of its UV-visible, 1 H-NMR and FD-MS (m/e 
580) spectra. In addition, the pigment at 0.91 was canthaxanthin (Example 7(2)). 

so 

Example 7: Identification of metabolic intermediates of xanthoohvll 

(1) Identification of 4-ketozeaxanthin 

55 The zeaxarrthin producing Escherichia coli was prepared according to the following procedure. That is to say. the 
plasmid pCAR25 having total carotenoid synthesis gene cluster of Er. uredorora (Misawa, N. t Nakagawa, M., Koba- 
yashi, K., Yamano, S., Izawa, Y, Nakamura, K., Harashima, K., "Elucidation of the Erwinia uredovora Carotenoid Bio- 
synthetic Pathway by Functional Analysis of Gene Products expressed in Escherichia coli ". J. Bacterid., 1 72, p. 6704- 
6712, 1990; and Japanese Patent Application No. 58786/1991 (Japanese Patent Application No. 53255/1990): "DNA 
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Strands useful for the Synthesis of Carotenoids") was digested with restriction enzyme BstEII, and subjected to the Kle- 
now fragment treatment and ligation reation to inactivate the ci£ gene by reading frame shift, and then a 6.5 kb Asp_718 
(Kenl) - EsqRI fragment containing the cdE, ss£L JZlL £dB and ciiZ genes required for producing zeaxanthm was cut 
out" This fragment was then inserted into the EggRV site of the Escherichia coli vector P ACYC184 to give the aimed 
plasmid (referred to hereinafter as pACCAR25AcrtX). 

The zeaxanthin-producing Escherichia coli JM101 having pAK910 or pAK916 (Fig. 12) introduced thereinto 
( Escherichia coli (pACCAR25AcrtX. pAK910 or pAK916); exhibiting orange) was cultured in 2 liters of a 2YT culture 
medium containing 150 M g/ml of Ap and 30 P g/ml of Cm at 37°C for 18 hours. Bacterial cells collected from the curture 
solution was extracted with 300 ml of acetone, concentrated, extracted twice with 200 ml of chloroform/methanol (9/1). 
and concentrated to dryness. Then, thin layer chromatography (TLC) was conducted by dissolving the residue in a 
small amount of chloroform/methanol (9/1) and developing on a silica gel plate for preparative TLC manufactured by 
Merck with chloroform/methanol (1 5/1 ). The original orange pigment was separated into three spots at the Rf values of 
0 54 (46%) 0 72 (53%) and 0.91 (1%) by TLC. The pigment at Rf 0.54 was scratched off from the TLC plate, dissolved 
in a small amount of chloroform/methanol (9/1) or methanol, and chromatographed on a Sephadex LH-20 column (15 
x 300 mm) with an eluent of chloroform/methanol (9/1) or methanol to give a purified material in a yield of 1.5 mg. 

This matenal was identified as 4-ketozeaxanthin (see Fig. 1 1 for the structural formula) since its UV-v.s.ble spec- 
trum FD-MS spectrum (m/e 582) and mobility in silica gel TLC (developed with chloroform/methanol (15/1)) accorded 
perfectly with those of the standard sample of 4-ketozeaxanthin (purified from Aqrobacterium aurantiacus sp. nov. MK1 ; 
Japanese Patent Application No. 70335/1993). In addition, the pigments at Rf 0.72 and 0.91 are astaxanth.n (Example 
6) and canthaxanthin (Example 7 (2)), respectively. 

(2) Identification of canthaxanthin 

The p-carotene producing Fsr.herichia coli JM1 01 having pAK910 or pAK916 introduced thereinto ( Escherichia coli 
(pACCAR16AcrtX P AK910 or P AK916): exhibiting orange) was cultured in 2 liters of a 2YT culture medium containing 
150 ug/ml of Ap and 30 »Q/m\ of Cm at 37°C for 18 hours. Bacterial cells collected from the cutture solution was 
extracted with 300 ml of acetone, concentrated, extracted twice with 200 ml of chloroform/methanol (9/1). and concen- 
trated to dryness Then, thin layer chromatography (TLC) was conducted by dissolving the residue in a small amount of 
chloroform/methanol (9/1) and developing on a silica gel plate for preparative TLC manufactured by Merck with chloro- 
form/methanol (50/1). The pigment of the darkest spot corresponding to 94% of the total amount of orange pigments 
was scratched off from the TLC plate, dissolved in a small amount of chloroform/methanol (9/1) or chloroform/methanol 
(1/1). and chromatographed on a Sephadex LH-20 column (15 x 300 mm) with an eluent of chloroform/methanol (9/1) 
or chloroform/methanol (1/1 ) to give a purified material in a yield of 3 mg. , ,_, KtxtCi 

This material was identified as canthaxanthin (see Fig. 1 1 for the structural formula) since its UV-v.sible. H-NMR 
FD-MS (m/e 564) spectra and mobility in silica gel TLC (Rf 0.53 on developing with chloroform/methanol (50/1)) 
accorded perfectly with those of the standard sample of canthaxanthin (manufactured by BASF). In addition, the pig- 
ment corresponding to 6% of the total orange pigments found in the initial extract was considered echmenone (see Fig. 
11 for the structural formula) on the basis of its UV-visible spectrum, mobility in silica gel TLC (Rf 0.78 on developing 
with chloroform/methanol (50/1)). and mobility in HPLC with NOVA PACK HR 6 M C18 (3.9 x 300 mm; manufactured by 
Waters) (RT 16 minutes on developing at a flow rate of 1 .0 ml/min with acetonitrile/methanol/2-propanol (90/6/4)). 

(3) Identif ication of zeaxanthin 

The p-carotene-producing Fsr.herichia coli JM101 having pAK96NK introduced thereinto ( Escherichia coli 
(pACCAR16AcrtX. pAK96NK); exhibiting yellow) was cultured in 2 liters of a 2YT culture medium containing 1 50 M g/ml 
of Ap and 30 jig/ml of Cm at 37°C for 1 8 hours. Bacterial cells collected from the culture solution was extracted with 300 
ml of acetone concentrated, extracted twice with 200 ml of chloroform/methanol (9/1). and concentrated to dryness. 
Then thin layer chromatography (TLC) was conducted by dissolving the residue in a small amount of chloroform/meth- 
anol (9/1) and developing on a silica gel plate for preparative TLC manufactured by Merck with chloroform/methanol 
(9/1) The pigment of the darkest spot corresponding to 87% of the total amount of yellow pigments was scratched off 
from the TLC plate dissolved in a small amount of chloroform/methanol (9/1) or methanol, and chromatographed on a 
Sephadex LH-20 column (15 x 300 mm) with an eluent of chloroform/methanol (9/1) or methanol to give a purified 
material in a yield of 3 mg. 

It has been elucidated that this material has the same planar structure as that of zeaxanthin since its UV-visible. 
1 H-NMR FD-MS (m/e 568) spectra and mobility in silica gel TLC (Rf 0.59 on developing with chloroform/methanol 
(9/1)) accorded perfectly with those of the standard sample of zeaxanthin (manufactured by BASF). When the pigment 
was dissolved in diethyl ether : 2-propanol : ethanol (5 : 5 : 2) to measure the CD spectrum, it was proved to have a 
stereochemical configuration of 3R. 3'R. and thus identified as zeaxanthin (see Fig. 1 1 for the structural formula). Also, 
the pigment corresponding to 13% of the total yellow pigments found in the initial extract was considered p-cryptoxan- 
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thin (see Fig. 1 1 for the structural formula) on the basis of its UV-visible spectrum, mobility in silica gel TLC (Rf 0.80 on 
developing with chloroform/methanol (9/1)), and mobility in HPLC with NOVA PACK HR 6n C18 (3.9 x 300 mm; manu- 
factured by Waters) (RT 19 minutes on developing at a fbw rate of 1.0 ml/min with aretonitrile/methanol/2-propanol 
(90/6/4)). 



(4) Identification of p-carotene 

The lycopene-producing Escherichia coli JM101 having pAK98 introduced thereinto (Escherichia coli (pACCRT- 
EIB, pAK98); exhibiting yellow) was cultured in 2 liters of a 2YT culture medium containing 150 jig/ml of Ap and 30 ^g/ml 
10 of Cm at 37°C for 1 8 hours. Bacterial cells collected from the culture solution was extracted with 300 ml of acetone, con- 
centrated, and extracted twice with 200 ml of hexane. The hexane layer was concentrated and chromatographed on a 
silica gel column (15 x 300 mm) with an eluent of hexane/ethyl acetate (50/1) to give 3 mg of a purified material. 

The material was identified as p-carotene (see Fig. 1 1 for the structural formula), since all of the data of its UV-vis- 
ible, FD-MS spectrum (m/e 536) and mobility in HPLC with NOVA PACK HR 6ji C18 (3.9 x 300 mm; manufactured by 
15 Waters) (RT 62 minutes on developing at a flow rate of 1.0 ml/min with acetonitrile/methanol/2-propanol (90/6/4)) 
accorded with those of the standard sample of p-carotene (all trans type; manufactured by Sigma). 

Example 8: Identification of xanthoohvll synthesis aene cluster 

20 (1) Identification of a keto group-introducing enzyme gene 

It is apparent from the results of Example 6 that among the 3.9 kb fragment contained in pAK9 (Example 4) or 
pAK92, all of the genes required for the synthesis of astaxanthin from lycopene is contained in the 2.9 kb SannHl fra 9- 
ment at the right side (pAK96, Fig. 1 2). Thus, the 1 .0 kb fragment at the left side is not needed. Unique NsqI and Kpjil 

25 sites are present within the 2.9 kb Bam HI fragment of pAK96. It is found from the results of Example 7 (3) that the 1 .4 
kb fragment (pAK96NK) between the Nqq\ and Kpnl sites has a hydroxyl group-introducing enzyme activity but has no 
keto group-introducing enzyme activity. Canthaxanthin can also be synthesized from p-carotene with the 2.9 kb BamHI 
fragment from which a fragment of the right side from unique Sail site between the Nco l and Kpnl sites had been 
removed (pAK910) or with the 2.9 kb Bam HI fragment from which a fragment of the right side from the Hindi site posi- 

30 tioned at the left side of the Sa[l site had been removed (pAK916). but activity for synthesizing canthaxanthin from p- 
carotene disappeared in the 2.9 kb Bam HI fragment of pAK96 from which a fragment of the right side from the N£Ql site 
left of the Hindi site had been removed. On the other hand, even if a fragment of the left side from unique Bglll site 
which is present leftward within the 0.9 kb Bam HI - Hindi fragment of pAK916 was removed, similar activity to that of 
the aforementioned Bam HI - Hind i fragment (pAK916) was observed. It is thus considered that a gene encoding a keto 

35 group-introducing enzyme having an enzyme adivity for synthesizing canthaxanthin from p-carotene as a substrate is 
present within the 0.74 kb Bglli - Hindi fragment of pAK916, and the aforementioned Ncol site is present within this 
gene. As a result of determining the nucleotide sequence, an open reading frame which corresponds to the gene and 
has a ribosome binding site just in front of the initiation codon was successfully deteded, it was then designated as the 
crtW gene. The nucleotide sequence of the crtW gene and the encoded amino acid sequence are illustrated in Fig. 1 

40 (SEQ ID NO: 1). 

The crtW gene produd (CrtW) of Aarobacterium aurantiacus sp. nov. MK1 has an enzyme activity for converting a 
methylene group at the 4-position of a p-ionone ring into a keto group, and one of the specific examples is an enzyme 
activity for synthesizing canthaxanthin from p-carotene as a substrate by way of echinenone (Example 7 (2); see Fig. 
1 1). Furthermore, the crtW gene produd also has an enzyme activity for converting a methylene group at the 4-position 

45 of a 3-hydroxy-p-ionone ring into a keto group, and one of the specific examples is an enzyme activity for synthesizing 
astaxanthin from zeaxanthin as a substrate by way of 4-ketozeaxanthin (Example 7 (1); see Fig. 11). In addition, 
polypeptides having such enzyme adivities and DNA strands encoding these polypeptides have not hitherto been 
known, and the polypeptides and the DNA strands encoding these polypeptides have no overall homology to any 
polypeptides or DNA strands having been hitherto known. Also, no such informations have hitherto been described that 

so a methylene group of not only a p-ionone ring and a 3-hydroxy-p-ionone ring but also the other compounds is directly 
converted into a keto group with an enzyme. 

(2) Identification of a hydroxyl group-introducing enzyme gene 

55 Unique Sail site is present within the 2.9 kb Bam HI fragment of pAK96. When the 2.9 kb Bam HI fragment is cut into 
two fragments at the Sail site, these two fragments (pAK910 and pAK98) have no hydroxyl group-introdudng activity. 
That is to say, the left fragment (pAK910) has only a keto group-introducing enzyme activity (Example 7 (2)), and the 
right fragment (pAK98) has only a lycopene-cyclizing enzyme adivity (Example 7 (4)). On the other hand, when a 1.4 
kb Ncol - KqqI fragment (pAK96NK) containing the aforementioned Sail site is introduced into a p-carotene-producing 
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EscherichiacoL zeaxanthin is synthesized by way of p-cryptoxanthin (Example 7 (3)). It is thus considered that a gene 
encoding a hydroxyl group-introducing enzyme which has an enzyme activity for synthesizing zeaxarith.n from p-caro- 
tene as a sutetraS is present within the 1 .4 kb Nsgl - Kpul fragment of pAK96NK, and the aforementioned Sail srteis 
present within this gene. As a result of determining the nucleotide sequence, an open reading frame which corresponds 
to^elne and has a ribosome binding site just in front of the initiation codon was successfully detected, rt was then 
referred to as the crtZ gene. The nucleotide sequence of the crtZ gene and the encoded amino acid sequence are illus- 
trated in Fig. 2 (SEQ ID NO: 2). 

The crtZ qene product (CrtZ) of A n rnh fl ^pri.,m aurantiacus sp. nov. MK1 has an enzyme activity for adding a 
hydroxyl g^up to the 3-carbon of a p-ionone ring, and one of the specific examples is an enzyme activrty for synthesiz- 
ing zixa'nthin from p-carotene as a substrate by way of p-cryptoxanthin (Example 7 (3); see Fig. 11) Fu*ermore the 
Jz gene product also has an enzyme activity for adding a hydroxyl group to the 3-carbon of a 4-keto-p-.onone ^r.ng 
anyone of the specific examples is an enzyme activity for synthesizing astaxanthin from canthaxanthm as a substrate 
by way of Phoe^coxanthin (Example 6; see Fig. 11). In addition, polypeptides having the latter enzyme activrty and 
DNA strands encoding these polypeptides have not hitherto been known. Also, the CrtZ of Agrobactenum showed sig- 
nificant homology to the CrtZ of Frwinia uredovora (identity of 57%) at the level of amino acid sequence. 

(3) Identification of a lycopene cyclase gene 

Astaxanthin can be synthesized from p-carotene with the 2.9 kb BamHI fragment from which a fragment of the right 
side from a Kpnl site had been removed (pAK96K) or with the 2.9 kb BamHI fragment from which a fragment right from 
trfe Pst. site^ich is placed further right of the K^l site had been removed (pAK94) (Example 6) but astaxanthin can- 
not ^synthesized from lycopene. On the other hand, when a 1 .6 kb Sail fragment (pAK98). which conta.ns a ngh frag- 
ment frcxn unique Sail site present further left than the a^ 

SuSd into lyc^ene-producing F^hsrichia co.i . p-carotene was synthesized (Examp.e 7 (4)). rt ,s thus considered 
S geTe enciS lyco^ne cydase that has an enzyme activrty for sy 

strate is present within the 1.6 kb SMI fragment of pAK98. and this gene is present over a range of the KjmI srte and the 
Pstl site As a result of determining the nucleotide sequence, an open reading frame which corresponds to the gene 
fS has a ribosome binding srte just in front of the initiation codon was successfully detected, it was then referred to as 
The crfn gene. The nucleotide sequence of the crtY gene and the amino acid sequence to be encoded are illustrated .n 

30 Fi9S The crtY E gene product (CrtY) of aurantiacus sp. nov. MK1 has significant homology to the CrtY of 

Frwinia uTedovora (identity of 44.3%) at the level of amino acid sequence, and the functions of both enzymes are the 
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3 5 F* am P'* Q- Southern hlottino analysis with thA r.hromosomal DNA of the other marine bacteria 

Examination was conducted whether a region exhibiting homology with the isolated zM [ a nd is obtainec I from 
a chromosomal DNAs of the other marine microorganisms. The chromosomal DNAs ol AlcaJsg^sa sp. PC-land 
Arteromonas sp. SD-402 prepared in Example 1 were digested with restriction enzymes BamHI and Pst and sepa- 
m X agar ote gel electrophoresis. All of the DNA fragments thus separated were denatured wito an a.ka , ^solution 
of 0 5 N NaOH and 1.5 M NaCI. and transferred on a nylon membrane filter over an overn.ght period. The ny ton mem- 
brane filter on which DNAs had been adsoibed was dipped in a hybridization solution ^ * Denhardt ^SSOl ~ W*^ 
ssDNA) and pre-hybridization was conducted at 60°C for 2 hours. Next, the 1.5 kb DNA fragment cut outfrom pAK96K 
with fisll. whion contains ertW and cd* was labelled with a Mega prime™ DNA labelling system. (Amersham) and [o- 
32 P]dCTP (- 1 1 0TBq/mmol) and added to the aforementioned prehybr.dizat.on solution to conduct hybridization at 60 C 

,0f '^hybridization, the filter was washed with 2 * SSC containing 0.1% SDS at 60°C for 1 hour, and subjected to 
the detection of signals showing homology by autoradiography. As a result, strong signals were obte ^.^bout 13 kb 
in the product digested with BamHI and at 2.35 kb in the product digested with Pstl in the case of Alcal.qenes spPC- 
50 1 and strong signals were obtained at about 5.6 kb in the product digested with BamHI and at 20 kb or more .n the prod- 
uct digested with P§ll in the case of Alteromonas sp. SD-4. 

Fvam piP m Arauisitinn of a xanth n phvll synthes is n*" ft clustprfrom the other marine bacterium 

As it was found from the results of Example 9 that the Esll digest of the chromosomal DNA of Al^iqenes , sp PC- 
1 has a region of about 2.35 kb hybridizing with a DNA fragment containing the erM and edZ genes of Agrobactenum 
auranlL?s sp.nov.MK1. the chromosomal DNA of Alcaliqenes was digested with Pstl. and then DNA fragment ■ « 2 
3.5 kb in size was recovered by agarose gel electrophoresis. The DNA fragments thus collected were insert« nto the 
pill site of a vector pBluescript II SK + . and introduced into Escherichia «?» DH5a to prepare a partial hbrary of A&aJb 
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genes . When the partial library was subjected to colony hybridization with a 1 .5 kb DNA fragment containing the crtW 
and ££iZ genes of Agrobacterium as a probe, a positive colony was isolated from about 5,000 colonies. In this case, 
colony hybridization was conducted under the same condition as in the Southern blotting analysis shown in Example 9. 
When plasmid DNA was isolated from the colony thus obtained, and digested with P§tl to examine the size of the inte- 

5 grated DNA fragments, it was found that the plasmid contained three different fragments. Thus, a 2.35 kb fragment to 
be hybridized was selected from the three different DNA fragments by the Southern blotting analysis described in 
Example 9, the 2.35 kb Pstl fragment was recovered by agarose gel electrophoresis and inserted again into the Pstl 
site of pBluescript II SK+ to prepare the plasmids pPC1 1 and pPCl2. In pPCl 1 and pPC12, the aforementioned 2.35 
kb Pstl fragment was inserted into the Pstl site of pBluescript II SK+ in an opposite direction to each other. The restric- 

70 tion enzyme map of pPCl 1 is illustrated in Fig. 19. 

Exam ple ii: Determination of nucleotide sequence of xanthoohvH synthesis aene duster in Alcaliqenes 

When each of pPC 1 1 and pPC1 2 was introduced into p-carotene-producing Escherichia gpli, orange colonies were 
is obtained due to the synthesis of astaxanthin (Example 12) in the former, but no other pigments were newly synthesized 
in the latter. It was thus considered that the direction of the astaxanthin synthesis gene cluster in the plasmid pPC11 
was the same as that of the vector lac promoter. It was also found that pPC1 1 contained no lycopene cyclizing enzyme 
genes, since no other pigments were newly produced even if pPCll was introduced into the lycopene-producing 
Escherichia coli . 

20 It was found that even if a plasmid having a 0.72 kb BstEII - EcoRV fragment positioned at the right side of the Pstl 

fragment had been removed (referred to as pPCl7, Fig. 19) was introduced into the p-carotene-producing Escheriphfr 
coli. the transformant of Escherichia coli synthesized astaxanthin and the like (Example 12), same as in the case of ^ 
coli into which pPCl 1 was introduced, so that the nucleotide sequence of the 1 .63 kb Estl - BslEII fragment in pPCl7 
was determined. 

25 Deletion mutants were prepared with pPCl 7 and pPCl2 according to the following procedure. A 10 ng portion of 

each of pPC17 and pPCl2 was digested with Kenl and Hindlll or Konl and EcqRI. extracted with phenol/chloroform, 
and DNA was recovered by precipitation with ethanol. Each of DNAs was dissolved in 100 ul of ExqIII buffer (50 mM 
Tris-HCI, 100 mM NaCI. 5 mM MgCI 2 . 10 mM 2-mercaptoethanol, pH 8.0). 180 units of ExqIII nuclease was added, and 
the mixture was maintained at 37°C. A 1 0 ^il portion was sampled at every 1 minute, and two samples were transferred 

30 into a tube in which 20 \i\ of an MB buffer (40 mM sodium acetate, 100 mM NaCI, 2 mM ZnCI 2 , 10% glycerol. pH 4.5) 
is contained and which is placed on ice. After completion of the sampling, five tubes thus obtained were maintained at 
65°C for 10 minutes to inactivate the enzyme, five units of mung bean nuclease were added, and the mixture was main- 
tained at 37°C for 30 minutes. After the reaction, ten DNA fragments different from each other in the degrees of deletion 
were recovered for each plasmid by agarose gel electrophoresis. The DNA fragments thus recovered were blunt ended 

35 with the Klenow fragment, subjected to the ligation reaction at 16°C overnight, and Escherichia CQli JM109 was trans- 
formed. A single stranded DNA was prepared from each of various clones thus obtained with a helper phage M13K07, 
and subjected to the sequence reaction with a fluorescent primer cycle-sequence kit available from Applied Biosystem 
(K.K.), and the DNA sequence was determined with an automatic sequencer. 

The DNA sequence comprising 1631 base pairs (bp) thus obtained is illustrated in Figs. 16-18 (SEQ ID NO: 7). 

40 As a result of examining an open reading frame having a ribosome binding site in front of the initiating codon, two open 
reading frames which can encode the corresponding proteins (A - B (nucleotide positions 99 - 824 of SEQ ID NO: 7), 
C - D (nucleotide positions 824 - 1 309) in Figs. 16-18 were found at the positions where the two xanthophytl synthesis 
genes crtW and crtZ were expected to be present. 

45 Example 12: Identif ication of pigments produced bv Escherichia coli having an Alcaliqenes xanthophyll synthesis gene 
cluster 

(1) Identification of astaxanthin and 4-ketozeaxanthin 

so A deletion plasmid (having only crtW) having a deletion from the right BstEII to the nucleotide position 1 162 (Fig. 
1 7) (nucleotide position 1 1 62 of SEQ ID NO: 7) among the deletion plasmids from pPC1 7 prepared in Example 1 1 was 
referred to as pPC17-3 (Fig. 19). 

The zeaxanthin-producing Escherichia coli JM101 (Example 7 (1)) having pPC17-3 introduced thereinto 
( Escherichia coli (pACCAR25AcrtX. pPC17-3): exhibiting orange) was cultured in 2 liters of 2YT culture medium con- 

55 taining 150 ng/ml of Ap and 30 pg/ml of Cm at 37°C for 18 hours. Bacterial cells collected from the culture solution was 
extracted with 300 ml of acetone, concentrated, extracted twice with 200 ml of chloroforrrVmethanol (9/1), and concen- 
trated to dryness. Then, thin layer chromatography (TLC) was conducted by dissolving the residue in a small amount of 
chloroform/methanoJ (9/1) and developing on a silica gel plate for preparative TLC manufactured by Merck with chloro- 
forrrVmethanol (15/1). The original orange pigment was separated into three spots at the Rf values of 0.54 (ca. 25%), 
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0 72 (ca 30%) and 0 91 (ca. 25%). The pigments at the Rf values of 0.54 and 0.72 were scratched off from the TLC 
elate dissolved in a small amount of chloroform/methanol (9/1) or methanol, and chromatographed on a Sephadex LH- 
20 column (15 x 300 mm) with an eluent of chloroform/methanol (9/1) or methanol to give purified materials in a yield 
of about 1 mg, respectively. 

The materials were identified as 4-ketozeaxanthin (Rf 0.54) and astaxanthin (Rf 0.72), since all of the data of their 
UV-visible FD-MS spectra and mobility in TLC (developed with chloroform/methanol (1 5/1 )) accorded with those of the 
standard samples of 4-ketozeaxanthin and astaxanthin. In addition, the pigment at the Rf value of 0.91 was canthaxan- 

th ' n ifwaTalo 1 cSirmed by the similar analytical procedures that the p-carotene-producing Escherichia cpli JM101 
having pPC1 1 or pPC1 7 introduced thereinto (Escherichia coli (pACCAR16AcrtX. pPC1 1 or pPC1 7) (exhibiting orange) 
produces astaxanthin, 4-ketozeaxanthin and canthaxanthin. Furthermore, it was also confirmed with the authentic sam- 
ple ol phoen.coxanthin obtained in Example 6 that these E. coli transformants produce a trace amount of phoemcoxan- 
thm. 

is (2) Identification of canthaxanthin 

The p-carotene-producing Fsr.herichia coli JM101 having P PCl7-3 introduced thereinto ( Escherichia coli 
(DACCARlSAcrtX P PCl7-3); exhibiting orange) was cultured in 2 liters of 2YT culture medium containing 150 ng/ml of 
Ad and 30 uq/ml of Cm at 37°C for 18 hours. Bacterial cells collected from the culture solution was extracted with 300 

20 Z of acetone, concentrated, extracted twice with 200 ml of chloroform/methanol (9/1 ). and concentrated to dryness. 
Then thin layer chromatography (TLC) was conducted by dissolving the residue in a small amount of chloroform/meth- 
anol (9/D and developing on a silica gel plate for preparative TLC manufactured by Merck with chloroform/methanol 
(50/1) The darkest pigment corresponding to 40% of the total amount of orange pigments was scratched off from the 
TLC Plate dissolved in a small amount of chloroform/methanol (9/1) or chloroform/methanol (1/1). and chromato- 

25 graphed on a Sephadex LH-20 column (15 x 300 mm) with an eluent of chloroform/methanol (9/1) or chloroform/meth- 
anol (1/1) to give a purilied material in a yield of 2 mg. 

The material was identified as canthaxanthin. since all of the data of its UV-visible. FD-MS (m/e 564) spectra and 
mobility in TLC (developed with chloroform/methanol (50/1)) accorded with those of the standard sample of canthaxan- 
thin (manufactured by BASF). In addition, the pigment of which amount corresponds to 50% of the ^mount of he 

so orange pigments observed in the initial extract was considered to be echmenone from its UV-v.s.ble spe^um mobHity 
Si gel TLC (developed with chloroform/methanol (50/1)), and mobility in HPLC with NOVA PACK HR 6u CIS ; CM 
x 300 mm; manufactured by Waters) (developed with acetonitrile/methanol/2-propanol (90/6/4)) (Example 7 (2)). In 
addition, the balance of the extracted pigments, 10%. was unreacted p-carotene. 

35 (3) Identification of zeaxanthin 

A plasmid having a 1 . 1 5 kb Sail fragment within pPC1 1 inserted in the same direction as the plasmid pPC1 1 into 
the Sail site of pBluescript II SK+ was prepared (referred to as pPC1 3. see Fig. 19). 

"The p-carotene-producing F^hPrichia coli JM101 having P PC13 introduced thereinto ( Escherichia col. 

40 (pACCARI 6AcrtX. P PC13); exhibiting yellow) was cultured in 2 liters of 2YT culture medium containing 1 50 ng/ml of Ap 
and 30 uo/ml of Cm at 37°C for 18 hours. Bacterial cells collected from the culture solution was extracted with 300 ml 
of acetone concentrated, extracted twice with 200 ml of chloroform/methanol (9/1 ). and concentrated to dryness. Then 
thin layer chromatography (TLC) was conducted by dissolving the residue in a small amount of chlorofonWmethanol 
(9/1) and developing on a silica gel plate for preparative TLC manufactured by Merck wrth chloroforrn/methanol (9/1). 

45 The darkest pigment corresponding to 90% of the total amount of orange pigments was scratched off from the TLC 
olate dissolved in a small amount of chloroform/methanol (9/1 ) or methanol, and chromatographed on a Sephadex LH- 
20 column (15 x 300 mm) with an eluent of chloroform/methanol (9/1) or methanol to give a purified material in a yield 

0,3 Th 3 e material was identified as zeaxanthin. since all of the data of its UV-visible. FD-MS (m/e 568) spectra and 
<o mobility in TLC (developed with chloroform/methanol (9/1 )) accorded with those of the standard sample of zeaxanthin 
(Example 7 (3)) In addition the pigment of which amount corresponds to 10% of the total amount of the orange pig- 
ments observed in the initial extract was considered to be p-cryptoxanthin from its UV-visible spectrum, mobility ins.lica 
ge^ TLC (developed with chloroform/methanol (9/1)), and mobility in HPLC with NOVA PACK HR 6,. C18 (3.9 x 300 mm; 
manufactured by Waters) (developed with acetonitrile/methanol/2-propanol (90/6/4)) ( Example 7 (3)). 

£5 
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Example 13: Identification of the Alcaliaenes xanthophvtl synthes is oene cluster 



(1) Identification of a keto group-introducing enzyme gene 



5 It is apparent from the results of Examples 1 1 and 12 (1) that all of the genes required for the synthesis of astax- 

anthin from p-carotene among the 2.35 kb Pstl fragment contained in pPC1 1 is contained in the 1.63 kb P§tl - BstEII 
fragment (pPC1 7, Fig. 19) in the left side. Thus, the 0.72 kb BstEII • Pstl fragment in the right side is not needed. Unique 
Smal and SaM sites are present within the 1 .63 kb PsJI - BstEII fragment of pPCl 7 (Fig. 19). It is confirmed by the pig- 
ment analysis with a (3-carotene-producing Escherichia coli having the deletion plasmids introduced thereinto that the 

10 keto group-introducing enzyme activity was lost when the 0.65 kb and 0.69 kb fragments at the left side from Smal and 
Sai l sites were removed. It was also confirmed by the pigment analysis with a p-carotene-producing Escherichia con 
having the plasmid introduced thereinto that the plasmid having a 0.69 kb P§tl - Sail fragment positioned at the left side 
of the 1 .63 kb Pstl - BstEII fragment inserted into the Pstl - Sail site of pBluescript SK+ has no keto group-introducing 
enzyme activity. On the other hand, the deletion plasmid pPCl7-3 (Fig. 19) in which deletion from the BstEII end at the 

75 right end to the nucleotide No. 1162 (nucleotide position 1 162 in SEQ ID NO: 7) occurred has a keto group-introducing 
enzyme activity (Example 12 (1), (2)), so that it is considered a gene encoding a keto group-introducing enzyme having 
an enzyme activity for synthesizing canthaxanthin or astaxanthin with a substrate of p-carotene or zeaxanthin is present 
in the 1 162 bp fragment in pPCl7-3, and the aforementioned Sma l and Sail sites are present within this gene. As a 
result of determining the nucleotide sequence, an open reading frame which corresponds to the gene and has a ribos- 

20 ome binding site just in front of the initiation codon was successfully detected, so that it was referred to as the crtW 
gene. The nucleotide sequence of the crtW gene and the encoded amino acid sequence are illustrated in Figs. 13-14 
(SEQ ID NO: 5). 

The crtW gene product (CrtW) of Alcaligenes sp. PC-1 has an enzyme activity for converting a methylene group at 
the 4-position of a p-ionone ring into a keto group, and one of the specific examples is an enzyme activity for synthesiz- 
es ing canthaxanthin from p-carotene as a substrate by way of echinenone (Example 12 (2); see Fig. 11). Furthermore, 
the crtW gene product also has an enzyme activity for converting a methylene group at the 4-position of a 3-hydroxy-p- 
ionone ring into a keto group, and one of the specific examples is an enzyme activity for synthesizing astaxanthin from 
zeaxanthin as a substrate by way of 4-ketozeaxanthin (Example 12 (1); see Fig. 11). In addition, polypeptides having 
such enzyme activities and DNA strands encoding these polypeptides have not hitherto been known, and the polypep- 
30 tides and the DNA strands encoding these polypeptides have no total homology to any polypeptides or DNA strands 
having been hitherto known. Also, the crtW gene products (CrtW) of Aorobacte rium aurantiacus so. nov. MK1 and Alca: 
liaenes sp. PC-1 share high homology (identity of 83%) at the level of amino acid sequence, and the functions of both 
enzymes are the same. The amino acid sequence in the region of 17% having no identity among these amino acid 
sequences is considered not so significant to the functions of the enzyme. It is thus considered particularly in this region 
35 that a little amount of substitution by the other amino acids, deletion, or addition of the other amino acids will not afftect 
the enzyme activity. 

It can be said the keto group-introducing enzyme gene crtW of marine bacteria encodes the p-ionone or 3-hydroxy- 
p-ionone ring ketolase which converts directly the methylene group at the 4-position into a keto group irrelative to 
whether a hydroxyl group is added to the 3-position or not. In addition, no such informations have hitherto been 
40 described that a methylene group of not only a p-ionone ring and a 3-hydroxy- p-ionone ring but also the other com- 
pounds is directly converted into a keto group with one enzyme. 

(2) Identification of a hydroxyl group-introducing enzyme gene 

45 All of the genes rerquired for the synthesis of astaxanthin from p-carotene is contained in the 1 .63 kb P§t' * BstEII 

fragment (Fig. 19) of pPC17. One Sail site is present within the 1.63 kb Pstl - BslEII fragment of pPCl7. It is apparent 
from the results of Example 1 2 (3) that a hydroxyl group-introducing enzyme activity is present in a fragment at the right 
side from the Sail site. It is thus understood that the hydroxyl group-introducing enzyme activity is present in the 0.94 
kb Sail - BstEII fragment which is the right fragment in the 1 .63 kb Pstl - BstEII fragment. As a result of determining the 

so nucleotide sequence, an open reading frame which corresponds to the gene and has a ribosome binding site just in 
front of the initiation codon was successfully detected, it was referred to as the crtZ gene. The nucleotide sequence of 
the crtZ gene and the encoded amino acid sequence are illustrated in Fig. 15 (SEQ ID NO: 6). 

The crtZ gene product (CrtZ) of Alcalioenes sp. PC-1 has an enzyme activity for adding a hydroxyl group to the 3- 
carbon of a p-ionone ring, and one of the specific examples is an enzyme activity for synthesizing zeaxanthin from p- 

55 carotene as a substrate by way of p-cryptoxarrthin (Example 12 (3): see Fig. 1 1). Furthermore, the crJZ gene product 
also has an enzyme activity for adding a hydroxyl group to the 3-carbon of a 4-keto-p-ionone ring, and one of the spe- 
cific examples is an enzyme activity for synthesizing astaxanthin from canthaxanthin as a substrate by way of phoeni- 
coxanthin (Example 12 (1); see Fig. 11). In addition, polypeptides having the latter enzyme activity and DNA strands 
encoding these polypeptides have not hitherto been known. Also, the CrtZ of Alcaligenes sp. PC-1 showed significant 
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homology to the CrtZ of Erwinia uredovora (identity of 58%) at the level of amino acid sequence. In addition, the crtZ 
gene products (CrtZ) of Aarobacterium aurantiacus sp. nov. MK1 and Alcaligenes sp. PC-1 have high homology (iden- 
tity of 90%) at the level of amino acid sequence, and the functions of both enzymes are the same. The amino acid 
sequence in the region of 10% having no identity among these amino acid sequences is considered not so significant 
5 to the functions of the enzyme. It is thus considered particularly in this region that a little amount of substitution by the 
other amino acids, deletion, or addition of the other amino acids will not afftect the enzyme activity. 

(3) Consideration on minor biosynthetic pathways of xanthophylls 

w It has been elucidated by our studies with carotenoid synthesis genes of the epiphytic bacterium Erwjnia or the pho- 
tosynthetic bacterium Rhodobacter that carotenoid biosynthesis enzymes generally act by recognizing the half of a car- 
otenoid molecule as a substrate. By way of example, the lycopene cyclase gene of Erwinia, crtY, recognizes the halves 
of the lycopene molecule to cyclize it. When the phytoene desaturase gene crti of Rhodobacter was used for the syn- 
thesis of neurosporene in place of lycopene in Escherichia coli and erJY of Erwinia was allowed to work on it, the crJY 

is gene product recognizes the half molecular structure common to lycopene to produce a half cyclized p-zeacarotene 
(Linden, H., Misawa, N., Chamovits, D., Pecher, I., Hirschberg, J.. Sandmann, G., "Functional Complementation in 
Escherichia coli of Different Phytoene Desaturase Genes and Analysis of Accumulated Carotenes", Z. Naturforsch., 
46c, p. 1045-1051 , 1991). Also, in the present invention, when CrtW is allowed to work on p-carotene or zeaxanthin, 
echinenone or 4-ketozeaxanthin in which one keto group has been introduced is first synthesized, and when CrtZ is 

20 allowed to work on p-carotene or canthaxanthin, p-cryptoxanthin or phoenicoxanthin in which one hydroxyl group has 
been introduced is first synthesized. It can be considered because these enzymes recognize the half molecule of the 
substrate. Thus, while Escherichia coli having the £rtE, criB, Glii and £dY genes of Erwinia and the edZ gene of a 
marine bacterium produces zeaxanthin as described above, p-cryptoxanthin which is p-carotene having one hydroxyl 
group introduced thereinto can be detected as an intermediate metabolite. It can be thus considered that if CrtW is 

25 present, 3'-hydroxyechinenone or 3-hydroxy echinenone can be synthesized from p-cryptoxanthin as a substrate, and 
that phoenicoxanthin can be further synthesized by the action of CrtW on these intermediates. The present inventors 
have not identified these ketocarotenoids in the culture solutions; and the reason is considered to be that only a trace 
amount of these compounds is present under the conditions carried out in the present experiments. In fact, it was 
described that 3-hydroxyechinenone or 3'-hydroxyechinenone was detected as a minor intermediate metabolite of 

30 astaxanthin in a marine bacterium Aarobacterium aurantiacus sp. nov. MK1 as a gene source (Akihiro Yokoyama ed., 
"For the biosynthesis of astaxanthin in marine bacteria", Nippon Suisan Gakkai, Spring Symposium, 1994, Abstract, p. 
252, 1994). It can be considered from the above descriptions that minor metabolic pathways shown in Fig. 20 are also 
present in addition to the main metabolic pathways of astaxanthin shown in Fig. 11. 

35 Industrial Applicability 

According to the present invention, the gene clusters required for the biosynthesis of keto group-containing xantho- 
phylls such as astaxanthin, phoenicoxanthin, 4-ketozeaxanthin, canthaxanthin and echinenone have successfully been 
obtained from marine bacteria, and their structures, nucleotide sequences, and functions have been elucidated. The 
40 DNA strands according to the present invention are useful as genes capable of affording the ability of biosynthesis of 
keto group-containing xanthophylls such as astaxanthin to microorganisms such as Escherichia coli and the like. 
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SEQUENCE LISTING 

SEQ ID NO: 1 

SEQUENCE LENGTH: 639 

SEQUENCE TYPE: STRANDEDNESS : double 
TOPOLOGY: linear 

MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE: 

ORGANISM: Agrobacterium aurantiacus 

STRAIN: sp. nov. MK1 

SEQUENCE 

GTG CAT GCG CTG TGG TTT CTG GAC GCA GCG GCG CAT CCC ATC CTG GCG 48 

Met His Ala Leu Tip Ph e Leu Asp Ala Ala Ala His Pro He Leu Ala 

15 10 15 

ATC GCA AAT TTC CTG GGG CTG ACC TGG CTG TCG GTC GGA TTG TTC ATC S6 

He Ala Asn Ph e Leu Gly Leu Thr Trp Leu Ser Yal Gly Leu Phe He 

20 25 3.0 

ATC GCG CAT GAC GCG ATG CAC GGG TCG GTG GTG CCG GGG CGT CCG CGC 144 

He Ala His Asp Ala Met His Gly Ser Yat Yal Pro Gly Arg Pro Arg 

35 40 45 

GCC AAT GCG GCG ATG GGC CAG CTT GTC CTG TGG CTG TAT GCC GGA TTT 192 

Ala Asn Ala Ala Met Gly Gin Leu Yal Leu Trp Leu Tyr Ala Gly Phe 

50 55 60 
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TCC TGG CGC AAG ATG ATC GTC AAG CAC ATG GCC CAT CAC CGC CAT GCC 240 

Set Ttp Arg Lys Met He Ya I Lys His Met Ah His His Ar g His Ala 

65 70 75 80 

GGA ACC GAC GAC GAC CCC GAT TTC GAC CAT GGC GGC CCG GTC CGC TGG 288 

Gly Thr Asp Asp Asp Pro Asp Phe Asp His Gly Gly Pro Yal Arg Trp 

85 90 95 

' 5 TAC GCC CGC TTC ATC GGC ACC TAT TTC GGC TGG CGC GAG GGG CTG CTG 336 

Tyr Ala Arg Phe He Gly Thr Tyr Phe Gly Trp Arg Gin Gly Leu Leu 

1 00 1 05 HO 

CTG CCC GTC ATC GTG ACG GTC TAT GCG CTG ATC CTT GGG GAT CGC TGG 384 

Leu Pro Yal lie Yal Tbr Yal Tyr Ala Leu lie Leu Gly Asp Arg Trp 

ll 5 120 125 

ATG TAC GTG GTC TTC TGG CCG CTG CCG TCG ATC CTG GCG TCG ATC CAG 432 

Met Tyr Yal Yal Phe Trp Pro Leu Pro Ser lie Leu Ala Ser He Gin 

130 135 140 

35 CTG TTC GTG TTC GGC ACC TGG CTG CCG CAC CGC CCC GGC CAC GAC GCG 480 

Leu Phe Yal Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala 

145 150 155 160 

TTC CCG GAC CGC CAC AAT CCG CGG TCG TCG CGG ATC AGC GAC CCC GTG 528 

Phe Pro Asp Arg His Asn Ala Arg Ser Ser Arg He Ser Asp Pro Yal 

165 170 175 

TCG CTG CTG ACC TGC TTT CAC TTT GGC GGT TAT CAT CAC GAA CAC CAC 576 

50 Ser Leu Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His 

180 185 190 

£5 
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CTG CAC CCG ACG GTG CCG TGG TGG CGC CTG CCC AGC ACC CGC ACC AAG 624 
Leu His Pro Thr Yal Pro Trp Trp Arg Leu Pro Sc r Thr Arg Thr Lys 

195 200 205 

GGG GAC ACC GCA TGA 639 
Gly Asp Thr Ala *** 
210 
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SEQ ID NO: 2 

SEQUENCE LENGTH: 489 

SEQUENCE TYPE: STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE: 

ORGANISM: Agrobacterium aurantiacus 

STRAIN: sp . nov. MK1 

SEQUENCE 



ATG ACC AAt'tTC CTG ATC GTC GTC GCC ACC GTG CTG GTG ATG GAG TTG 48 

Met Thr Asn Phe Leu lie Yal Yal Ala Thr Yal Leu Yal Mel Glu Leu 

l 5 10 15 

ACG GCC TAT TCC GTC CAC CGC TGG ATC ATG CAC GGC CCC CTG GGC TGG . 96 

Thr Ala Tyr Ser Yal His Arg Trp He Met His Gly Pro Leu Gly Trp 

20 25 30 

35 GGC TGG CAC AAG TCC CAC CAC GAG GAA CAC GAC CAC GCG CTG GAA AAG 144 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 

40 35 40 4 5 

AAC GAC CTG TAC GGC CTG GTC TTT GCG GTG ATC GCC ACG GTG CTG TTC 192 

45 Asn Asp Leu Tyr Gly Leu Yal Phe Ala Yal lie Ala Thr Yal Leu Phe 
50 55 60 
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ACG GTG GGC TGG ATC TGG GCG CCG GTC CTG TGG TGG ATC GCC TTG GGC 240 

Thr Yal Glj Trp lie Trp Ala Pro Yal Leu Trp Trp lie Ala Leu Gly 

65 70 75 80 

ATG ACT GTC TAT GGG CTG ATC TAT TTC GTC CTG CAT GAC GGG CTG GTG 288 

Met Tbr Yal Tyr Gly Leu lie Tyr Phe Yal Leu His Asp Gly Leu Yal 

85 90 95 

CAT CAG CGC TGG CCG TTC CGT TAT ATC CCG CGC AAG GGC TAT GCC AGA 336 

His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr Ala Arg 

100 105 110 

CGC CTG TAT CAG GCC CAC CGC CTG CAC CAT GCG GTC GAG GGG CGC GAC 384 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Yal Glu Gly Arg Asp 

115 120 125 

CAT TGC GTC AGC TTC GGC TTC ATC TAT GCG CCC CCG GTC GAC AAG CTG 432 

His Cys Yal Ser Phe Gly Phe lie Tyr Ala Pro Pro Va I Asp Lys Leu 

130 135 140 

AAG CAG GAC CTG AAG ATG TCG GGC GTG CTG CGG GCC GAG GCG CAG GAG 480 

Lys Gin Asp Leu Lys Met Ser Gly Yal Leu Arg Ala Glu Ala Gin Glu 

145 150 155 160 

CGC ACG TGA 489 
Arg Thr — 

50 *** 
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SEQ ID NO: 3 

SEQUENCE LENGTH: 1161 

SEQUENCE TYPE : STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE: 

ORGANISM: Agrobacterium aurantiacus 

STRAIN: sp. nov. MK1 



SEQUENCE 

20 GTG ACC CAT GAC GTG CTG CTG GCA GGG GCG GGC CTT GCC AAC GGG CTG 48 

Met Thr His Asp Y2I Leu Leu Ala Gly Ala Gly Leu Ala Asn Gly Leo 

1 5 10 15 

ATC GCC CTG GCG CTG CGC GCG GCG CGG CCC GAC CTG CGC GTG CTG CTG 96 

He Ala Leu Ala Leu A r g Ala Ala Ar g Pro Asp Leu A r g Yal Leu Leu 

20 25 30 

CTG GAC CAT GCC GCA GGA CCG TCA GAC GGC CAC ACC TGG TCC TGC CAC 144 

Leu Asp His Ala Ala Gly Pro Ser Asp Gly His Thr Trp Ser Cys His 

35 40 45 

GAC CCC GAC CTG TCG CCG GAC TGG CTG GCG CGG CTG AAG CCC CTG CGC 192 

Asp Pro Asp Leu Ser Pro Asp Trp Leu Ala A r g Leu Lys Pro Leu Arg 

50 55 60 
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CGC GCC AAC TCG CCC GAC CAG GAG GTG CGC TTT CCC CGC CAT GCC CGG 240 

Arg Ah Asn Trp Pro Asp Gin Glu Va 1 Arg Phe Pro Arg His Ala Arg 

65 70 75 80 

CGG CTG GCC ACC GGT TAC GGG TCG CTG GAC GGG GCG GCG CTG GCG GAT 288 

Arg Leu Ala Thr Gly Tyr Gly Ser Leu Asp Gly Ala Ala Leu Ala Asp 
85 90 95 

' 5 • GCG GTG GTC CGG TCG GGC GCC GAG ATC CGC TGG GAC AGC GAC ATC GCC 336 

Ala Va I Val Arg Ser Gly Ala Glu He Arg Trp Asp Ser Asp He Ala 

100 105 110 

CTG CTG GAT GCG CAG GGG GCG ACG CTG TCC TGC GGC ACC CGG ATC GAG 384 

Leu Leu Asp Ala Gin Gly Ala Thr Leu Ser Cys Gly Thr Arg lie Glu 

115 120 125- 

GCG GGC GCG GTC CTG GAC GGG CGG GGC GCG CAG CCG TCG CGG CAT CTG 432 

Ala Gly Ala Va 1 Leu Asp Gly Arg Gly Ala Gin Pro Ser Arg His Leu 

130 135 140 

35 acc GTG GGT TTC CAG AAA TTC GTG GGT GTC GAG ATC GAG ACC GAC CGC 480 

Thr Val Gly Phe Gin Lys Phe Val Gly Va 1 Glu He Glu Thr Asp Arg 
145 150 155 460 

CCC CAC GGC GTG CCC CGC CCG ATG ATC ATG GAC GCG ACC GTC ACC' CAG 528 

Pro His Gly Val Pro Arg Pro Met He Met Asp Ala Thr Val Thr Gin 

165 170 175 

CAG GAC GGG TAC CGC TTC ATC TAT CTG CTG CCC TTC TCT CCG ACG CGC 576 

50 Gin Asp Gly Tyr Arg Phe lie Tyr Leu Leu Pro Phe Ser Pro Thr Arg 

180 185 190 

55 
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ATC CTG ATC GAG GAC ACG CGC TAT TCC GAT CGC GGC GAT CTG GAC GAC S24 
He Leu He Glu Asp Thr Arg Ty r Set Asp Gly Gly Asp Leo Asp Asp 

195 200 205 

GAC GCG CTG GCG GCG GCG TCC CAC GAC TAT GCC CGC CAG CAG GGC TGG 612 
Asp Ala Lea Ah Ala Ala Ser His Asp Tyr Ala Arg Gin Gin Gly Trp 

210 215 220 

ACC GGG GCC GAG GTC CGG CGC GAA CGC GGC ATC CTT CCC ATC GCG CTG 720 
Thr Gly Ala Glu Val Arg Arg Glu Arg Gly lie Leu Pro lie Ala Leu 
225 230 235 240 

GCC CAT CAT GCG GCG GGC TTC TGG GCC GAT CAC GCG GCG GGG CCT GTT 768 
Ala His Asp Ala Ala Gly Phe Trp Ala Asp His Ala Ala Gly Pro Val 

245 250 255 

CCC GTG GGA CTG CGC GCG GGG TTC TTT CAT CCG GTC ACC GGC TAT TCG 816 
Pro Val Gly Leu Arg Ala Gly Phe Phe His Pro Val Thr Gly Tyr Ser 

260 265 270 

CTG CCC TAT GCG GCA CAG GTG GCG GAC GTG GTG GCG GGT CTG TCC GGG 864 
Leu Pro Tyr Ala Ala Gin Val Ala Asp Val Val Ala Gly Leu Ser Gly 

275 280 285 

CCG CCC GGC ACC GAC GCG CTG CGC GGC GCC ATC CGC GAT TAC GCG ATC S12 
Pro Pro Gly Thr Asp Ala Leu Arg Gly Ala lie Arg Asp Tyr Ala lie 

290 295 300 

GAC CGG GCG CGC CCC GAC CGC TTT CTG CCC CTT TTG AAC CGG ATC CTG 960 
50 Asp Arg Ala Arg Arg Asp Arg Phe Leu Arg Leu Leu Asn Arg Mel Leu 

305 310 315 320 
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TTC CGC GGC TGC GCG CCC GAC CGG CGC TAT ACC CTG CTG CAG CGG TTC 1008 
Phe Arg Gly Cjs Ala Pro Asp Arg Ar g Tyr Thr Leo Leu Gin Arg Phe 

325 330 335 

TAC CGC ATG CCG CAT GGA CTG ATC GAA CGG TTC TAT CCC GGC CGG CTG 1056 
Tyr Arg Met Pro His Gly Leu He Gin Arg Phe Tyi Ala Gly Arg Leu 

340 345 350 

AGC GTG GCG GAT CAG CTG CGC ATC GTG ACC GGC AAG CCT CCC ATT CCC 1104 
Ser Yal Ala Asp Gin Leu Arg lie Val Thr Gly Lys Pro Pro He Pro 

355 360 365 

CTT GGC ACG GCC ATC CGC TGC CTG CCC GAA CGT CCC CTG CTG AAG GAA 1152 
Leu Gly Thr Ala He Arg Cys Leu Pro Glu Arg Pro Leu Leu Lys Glu 

370 375 380 

AAC GCA TGA H61 
Asn Ala *** 
385 
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SEQ ID NO: 4 

SEQUENCE LENGTH: 2886 

SEQUENCE TYPE: STRANDEDNESS: double 
TOPOLOGY : 1 inear 
MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE: 

ORGANISM: Agrobacterium aurantiacus 

STRAIN: sp. nov. MK1 

SEQUENCE 

GGATCCGGCG ACCTTGCGGC GCTGCGCCGC GCGCCTTTGC TGGTGCCTGG GCCGGGTGGC 
CCTAGGCCGC TGGAACGCCG CGACGCGGCG CGCGGAAACG ACCACGGACC CGGCCCACCG 

CAATGGTCGC AAGCAACGGG GATGGAAACC GGCGATGCGG GACTGTAGTC TGCGCGGATC 
GTTACCAGCG TTCGTTGCCC CTACCTTTGG CCGCTACGCC CTGACATCAG ACGCGCCTAG 

GCCGGTCCGG GGGACAAGAT GAGCGCACAT GCCCTGCCCA AGGCAGATCT GACCGCCACC 
CGGCCAGGCC CCCTGTTCTA CTCGCGTGTA CGGGACGGGT TCCGTCTAGA CTGGCGGTGG 

AGCCTGATCG TCTCGGGCGG CATCATCGCC GCTTGGCTGG CCCTGCATGT GCATGCGCTG 
TCGGACTAGC AGAGCCCGCC GTAGTAGCGG CGAACCGACC GGGACGTACA CGTACGCGAC 

TGGTTTCTGG ACGCAGCGGC GCATCCCATC CTGGCGATCG CAAATTTCCT GGGGCTGACC 
ACCAAAGACC TGCGTCGCCG CGTAGGGTAG GACCGCTAGC GTTTAAAGGA CCCCGACTGG 
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TGGCTGTCGG TCGGATTGTT CATCATCGCG CATGACGCGA TGCACGGGTC GGTGGTGCCG 360 
ACCGACAGCC AGCCTAACAA GTAGTAGCGC GTACTGCGCT ACGTGCCCAG CCACCACGGC 

GGGCGTCCGC GCGCCAATGC GGCGATGGGC CAGCTTGTCC TGTGGCTGTA TGCCGGATTT 420 
CCCGCAGGCG CGCGGTTACG CCGCTACCCG GTCGAACAGG ACACCGACAT ACGGCCTAAA 

TCGTGGCCCA AGATGATCGT CAAGCACATG GCCCATCACC GCCATGCCGG AACCGACGAC 480 
AGCACCGCGT TCTACTAGCA GTTCGTGTAC CGGGTAGTGG CGGTACGGCC TTGGCTGCTG 

GACCCCGATT TCGACCATGG CGGCCCGGTC CGCTGGTACG CCCGCTTCAT CGGCACCTAT 540 
CTGGGGCTAA AGCTGGTACC GCCGGGCCAC GCGACCATGC GGGCGAAGTA GCCGTGGATA 

TTCGGCTGGC GCGAGGGGCT GCTGCTGCCC GTCATCGTGA CGGTCTATGC GCTGATCCTT 600 
AAGCCGACCG CGCTCCCCGA CGACGACGGG CAGTAGCACT GCCAGATACG CGACTaGGAA 

GGGGATCGCT GGATGTACGT GGTCTTCTGG CCGCTGCCGT CGATCCTGGC GTCGATCCAG 660 
CCCCTAGCGA CCTACATGCA CCAGAAGACC GGCGACGGCA GCTAGGACCG CAGCTAGGTC 

CTGTTCGTGT TCGGCACCTG GCTGCCGCAC CGCCCCGGCC ACGACGCGTT CCCGGACCGC 720 
GACAAGCACA AGCCGTGGAC CGACGGCGTG GCGGGGCCGG TGCTGCGCAA GGGCCTGGCG 

CACAATGCGC GGTCGTCGCG GATCAGCGAC CCCGTGTCGC TGCTGACCTG CTTTCACTTT 780 
GTGTTACGCG CCAGCAGCGC CTAGTCGCTG GGGCACAGCG ACGACTGGAC GAAAGTGAAA 
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GGCCGTTATC ATCACGAACA CCACCTGCAC CCGACCGTGC CGTGGTGGCG CCTGCCCAGC 840 
CCGCCAATAG TAGTGCTTGT GGTGGACGTG GGCTGCCACG GCACCACCGC GGACGGGTCG 

ACCCGCACCA AGGGGGACAC CGCATGACCA ATTTCCTGAT CGTCGTCGCC ACCGTGCTGG 900 
TGGGCGTGGT TCCCCCTGTG GCG7ACTGGT TAAAGGACTA GCAGCAGCGG TGGCACGACC 

TGATGGAGTT GACGGCCTAT TCCGTCCACC GCTGGATCAT GCACGGCCCC CTGGGCTGGG 960 
ACTACCTCAA CTGCCGGATA AGGCAGGTGG CGACCTAGTA CGTGCCGGGG GACCCGACCC 

GCTGGCACAA GTCCCACCAC GAGGAACACG ACCACGCGCT GGAAAAGAAC GACCTGTACG 1020 
CGACCGTGTT CAGGGTGGTG CTCCTTGTGC TGGTGCGCGA CCTTTTCTTG CTGGACATGC 

GCCTGGTCTT TGCGGTGATC GCCACGGTGC TGTTCACGGT GGGCTGGATC TGGGCGCCGG 1 0 80 
CCGACCAGAA ACGCCACTAG CGGTGCCACG ACAAGTGCCA CCCGACCTAG ACCCGCGGCC 

TCCTGTGGTG GATCGCCTTG GGCATGACTG TCTATGGGCT GATCTATTTC GTCCTGCATC l l 40 
AGGACACCAC CTAGCGGAAC CCGTACTGAC AGATACCCGA CTAGATAAAC CAGGACGTAC 

ACGGGCTGGT GCATCAGCGC TGGCCGTTCC GTTATATCCC GCGCAAGGGC TATGCCAGAC 1200 
TGCCCGACCA CGTAGTCGCG ACCGGCAAGG CAATATAGGG CGCGTTCCCG ATACGGTCTG 

GCCTGTATCA GGCCCACCGC CTGCACCATG CGGTCGAGGG GCGCGACCAT TGCGTCAGCT 1260 
CGGACATAGT CCGGGTCGCG GACGTGGTAC GCCAGCTCCC CGCGCTGGTA ACGCAGTCGA 
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TCGGCTTCAT CTATGCGCCC CCGGTCGACA AGCTGAAGCA GGACCTGAAG ATGTCGGGCG 1320 

AGCCGAAGTA GATACGCGGG GGCCAGCTGT TCGACTTCGT CCTGGACTTC TACAGCCCGC 

TGCTGCGGGC CGAGGCGCAG GAGCGCACGT GACCCATGAC GTGCTGCTGG CAGGGGCGGG 1380 

ACGACGCCCG GCTCCGCGTC CTCGCGTG'CA CTGGGTACTG CACGACGACC GTCCCCGCCC 

CCTTGCCAAC GGGCTGATCG CCCTGGCGCT GCGCGCGGCG CGGCCCGACC TGCGCGTGCT 1440 

GGAACGGTTG CCCGACTAGC GGGACCGCGA CGCGCGCCGC GCCGGGCTGG ACGCGCACGA 

GCTGCTGGAC CATGCCGCAG GACCGTCAGA CGGCCACACC TGGTCCTGCC ACGACCCCGA 1500 

CCACGACCTG GTACGGCCTC CTGGCAGTCT GCCGGTGTGG ACCAGGACGG TGCTGGGGCT 

CCTGTCGCCG CACTGGCTGG CGCGGCTGAA GCCCCTGCGC CGCGCCAACT GGCCCGACCA 1560 

GGACAGCGGC CTGACCGACC GCGCCGACTT CGGGGACGCG GCGCGGTTGA CCGGGCTGGT 

GGAGGTGCGC TTTCCCCGCC ATGCCGGGCG GCTGGCCACC GGTTACGGGT CGCTGGACGG 1620 

CCTCCACGCG AAAGGGGCGG TACGGGCCGC CGACCGCTGG CCAATGCCCA GCGACCTGCC 

GGCGGCGCTG GCGGATGCGG TGGTCCGGTC GGGCGCCGAG ATCCGCTGGG ACAGCGACAT 1680 

CCGCCGCGAC CGCCTACGCC ACCAGGCCAG CCCGCGGCTC TAGGCGACCC TGTCGCTGTA 

CGCCCTGCTG GATGCGCAGG GGGCGACGCT GTCCTGCGGC ACCCGGATCG AGGCGGGCGC 1740 

GCGGGACGAC CTACGCGTCC CCCGCTGCGA CAGGACGCCG TGGGCCTAGC TCCGCCCGCC 
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GGTCCTGGAC GGGCGGGGCG CGCAGCCGTC GCGGCATCTG ACCGTGGCTT TCCAGAAATT 1800 

CCAGGACCTG CCCGCCCCGC GCGTCGGCAG CGCCGTAGAC TGGCACCCAA AGGTCTTTAA 

GAGATCGAGA CCGACCGCCC CCACGGCGTG CCCCGCCCGA TGATCATGGA I860 

CTCTAGCTCT GGCTGGCGGG GGTCCCGCAC GGGGCGGGCT ACTAGTACCT 

CGCGACCGTC ACCCAGCAGG ACGGGTACCG CTTCATCTAT CTGCTGCCCT TCTCTCCGAC 1920 

GCGCTGGCAG TGGGTCGTCC TGCCCATGGC GAAGTAGATA GACGACGGGA AGAGAGGCTG 

GCGCATCCTG ATCGAGGACA CGCGCTATTC CGATGGCGGC GATCTGGACG ACGACGCGCT 1 9 80 

CGCGTAGGAC TAGCTCCTGT GCGCGATAAG GCTACCGCCG CTAGACCTGC TGCTGCGCGA 

GGCGGCGCCG TCCCACGACT ATGCCCGCCA GCAGGGCTGG ACCGGGGCCG AGGTCCGGCG 2040 

CCGCCGCCGC AGGGTGCTGA TACGGGCGGT CGTCCCGACC TGGCCCCGGC TCCAGGCCGC 

CGAACGCGGC ATCCTTCCCA TCGCGCTGGC CCATGATGCG GCGGGCTTCT GGGCCGATCA 2100 

GCTTGCGCCG TAGGAAGGGT AGCGCGACCG GGTACTACGC CGCCCGAAGA CCCGGCTAGT 

CGCGGCGGCG CCTGTTCCCG TGGGACTGCG CGCGGGGTTC TTTCATCCGG TCACCGGCTA 2160 

GCGCCGCCCC GGACAAGGGC ACCCTGACGC GCGCCCCAAG AAAGTAGGCC AGTGGCCGAT 

TTCGCTGCCC TATGCGGCAC AGGTGGCGGA CGTGGTGGCG GGTCTGTCCG GGCCGCCCGG 2220 

AAGCGACGGG ATACGCCGTG TCCACCGCCT GCACCACCGC CCAGACAGGC CCGGCGGGCC 



CGTGGGTGTC 
GCACCCACAG 
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CACCGACGCG CTGCGCGGCG CCATCCGCGA TTACGCGATC GACCGGGCGC 

5 GTGGCTGCGC GACGCGCCGC GGTAGGCGCT AATGCGCTAG CTGGCCCGCG 

CTTTCTGCGC CTTTTGAACC GGATCCTGTT CCGCGGCTGC GCGCCCGACC GGCGCTATAC 2340 

10 

GAAAGACGCG GAAAACTTGG CCTACGACAA GGCGCCGACG CGCGGGCTGG CCGCGATATG 

CCTGCTGCAG CGGTTCTACC GCATGCCGCA TGGACTGATC GAACGGTTCT ATGCCGGCCG 2400 

GGACGACGTC GCCAAGATGG CGTACGCCGT ACCTGACTAG CTTGCCAAGA TACGGCCGGC 

20 

GCTGAGCGTG GCGGATCAGC TGCGCATCGT GACCGGCAAG CCTCCCATTC CCCTTGGCAC 2460 

CGACTCGCAC CGCCTAGTCG ACGCGTAGCA CTGGCCGTTC GGAGGGTAAG GGGAACCGTG 

GGCCATCCGC TGCCTGCCCG AACGTCCCCT GCTGAAGGAA AACGCATGAA CGCCCATTCG 2520 

CCGGTAGGCG ACGGACGGGC TTGCAGGGGA CGACTTCCTT TTGCGTACTT GCGGGTAAGC 

35 

CCCGCGGCCA AGACCGCCAT CGTGATCGGC GCAGGCTTTG GCGGGCTGGC CCTGGCCATC 2580 

GGGCGCCGGT TCTGGCGGTA GCACTAGCCG CGTCCGAAAC CGCCCGACCG GGACCGGTAG 

40 

CGCCTGCAGT CCGCGGGCAT CGCCACCACC CTGGTCGAGG CCCGGGACAA GCCCGGCGGG 2640 

GCGGACGTCA GGCGCCCGTA GCGGTGGTGG GACCAGCTCC GGGCCCTGTT CGGGCCGCCC 

CGCGCCTATG TCTGGCACGA TCAGGGCCAT CTCTTCGACG CGGGCCCGAC CGTCATCACC 2700 

GCGCGGATAC AGACCGTGCT AGTCCCGGTA GAGAAGCTGC GCCCGGGCTG GCAGTAGTGG 




GCCGCGACCG 2280 
CGGCGCTGGC 
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GACCCCGATG. CGCTGAAAGA GCTGTGGGCC CTGACCGGGC AGGACATGGC GCGCGACGTG 2760 
CTGGGGCTAC GCGACTTTCT CGACACCCGG GACTGGCCCG TCCTGTACCG CGCGCTGCAC 

ACGCTGATGC CGGTCTCGCC CTTCTATCGG CTGATGTGGC CGGGCGGGAA GGTCTTCGAT 2820 
TGCGACTACG GCCAGAGCGG GAAGATAGCC GACTACACCG GCCCGCCCTT CCAGAAGCTA 

TACGTGAACG AGGCCGATGC AGGGTCTGGG TCTTGCCGTG CCAGGTGAAG CTGTTGCCGT 2880 
ATGCACTTGC T.CCGGCTAGG TCCCAGACCC AGAACGGCAC GGTCCACTTC GACAACGGCA 



GGATCC 2886 
CCTAGG 
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SEQ ID NO: 5 

SEQUENCE LENGTH: 729 

SEQUENCE TYPE: STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE: 

ORGANISM: Alcaligenes 

STRAIN: sp. PC-1 

SEQUENCE 



ATG TCC GGA CGG AAG CCT GGC ACA ACT GGC GAC ACG ATC GTC AAT CTC 48 

Met Ser Gly Arg Lys Pro Gly Thr Thr Gly Asp Thr He Yal Asn Leu 

15 10 15 

GGT CTG ACC GCC GCG ATC CTG CTG TGC TGG CTG GTC CTG CAC GCC TTT 96 

30 Giy Leu Thr Ala Ala lie Leu Leu Cy.s Trp Leu Yal Leu His Ala Phe 

20 25 30 

35 ACG CTA TGG TTG CTA GAT GCG GCC GCG CAT CCG CTG CTT GCC GTG CTG 144 
Thr Leu Trp Leu Leu Asp Ah Ala Ala His Pro Leu Leu Ala Yal Leu 

35 " 40 45 

TGC CTG GCT GGG CTG ACC TGG CTG TCG GTC GGG CTG TTC ATC ATC GCG 192 

Cys Leu Ala Gly Leu Thr Trp Leu Ser Yal Gly Leu Phe lie lie Ala 

50 55 60 
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so 



25 



30 



35 



40 



45 



50 



CAT GAC 
His Asp 
65 

GCG GCG 
Ala Ala 

CCC A AG 
Pro Lys 

GAC AAC 
Asp As n 

AGC TTC 
Set Phe 
130 
GTG ATC 
Val lie 
145 

GTC ATC 
Val He 

GTC TTC 
Val Phe 



GCA ATG 
Ala Met 

ATC GGG 
He G!y 

CTG ATC 

Lei! He 
100 

GAT CCC 

Asp Pro 
115 

GTC TCC 

Val Ser 

GTC ACC 
Val Th r 

TTC TCG 
Phe Trp 

GGA ACT 
Glj Th r 
180 



CAC GGG 
His Gly 
10 

CAA CTG 
Gin Leu 
85 

GCC AAG 
Ala Lys 

GAT TTC 

As? Phe 

ACC TAT 
Thr Tyr 

ACC TAT 
Thr Tyr 
150 
CCG GTC 
Pro Val 
165 

TGG CTG 
Trp Leo 



TCC GTG GTG 
Ser Val Val 

GCG CTG TGG 
Ala Leu Trp 

CAC ATG ACG 
His Mel Thr 
105 

GGT CAC GGA 
Gly His Gly 
120 

TTC GGC TGG 
Phe Gly Trp 
135 

GCG CTG ATC 
Ala Leu lie 

CCG GCC GTT 
Pro Ala Yal 

CCC CAC CGC 
Pro His Arg 
185 



CCG GGG 
Pro Gly 
75 

CTC TAT 
Leo Tyr 
90 

CAT CAC 
His His 

GGG CCC 
Gly Pro 

CGA GAG 
Arg GIu 

CTG GGC 
Leu Gly 
155 
CTG GCG 
Leu Ala 
170 

CCG GGA 
Pro Gly 



CGG CCG CGC GCC AAT 
Arg Pro Arg Ala Asn 



240 



GCG GGG TTC 
Ala Gly Phe 

CGG CAC GCC 
Arg His Ala 
110 

GTG CGC TGG 
Val Arg Trp 
125 

GGA CTG CTG 
Gly Leu Leu 
140 

GAT CGC TGG 
Asp Arg Trp 

TCG ATC CAG 
Ser He Gin 

CAT GAC GAT 
His Asp Asp 
190 



TCG TGG 
Ser Trp 
95 

GGC ACC 
Gly Thr 

TAC GGC 
Tyr Gly 

CTA CCG 
Leu Pro 

ATG TAT 
Met Tyr 
160 
ATT TTC 
He Phe 
175 

TTT CCC 
Phe Pro 
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GAC CGG CAC AAC GCG AGG TCG ACC GGC ATC GGC GAC CCG TTG TCA CTA 624 

Asp Arg His Asn Ala Arg Ser Tin Gly He Gly Asp Pro Leu Ser Leu 

195 200 205 

CTG ACC TGC TTC CAT TTC GGC GGC TAT CAC CAC GAA CAT CAC CTG CAT 672 

Leu Thr Cys Phe His Pbe Gly Gly Tyr His His G I a His His Leu His 

210 215 220 

CCG CAT GTG CCG TGG TGG CGC CTG CCT CGT ACA CGC AAG ACC GGA GGC '720 

Pro His YaJ Pro Trp T r p Arg Leu Pro Arg Thr Arg Lys Thr Gly Gly 
225 230 235 240 

CGC GCA TGA 729 
Arg Alan* 
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SEQ ID NO: 6 

SEQUENCE LENGTH: 489 

SEQUENCE TYPE: STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE: 

ORGANISM: Alcaligenes 

STRAIN: sp. PC-1 



SEQUENCE 

ATG ACG CAA'TTC CTC ATT GTC GTG GCG ACA GTC CTC GTG ATG GAG CTG 48 

Met Th r Gin Phe Leu He Yal Ya I Ala Thr Ya! Leu Ya I Met Glu Leu 

5 10 15 

ACC GCC TAT TCC GTC CAC CGC TGG ATT. ATG CAC GGC CCC CTA GGC TGG 96 

30 Thr Ala Tyr Ser Y2I His Arg Tip IU Met His Gly Pro Leu Gly Trp 

20 25 30 

GGC TGG CAC AAG TCC CAT CAC GAA GAG CAC GAC CAC GCG TTG GAG AAG 144 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Giu Lys 

35 40 45 

AAC GAC CTC TAC GGC GTC GTC TTC GCG GTG CTG GCG ACG ATC CTC TTC 192 

Asn Asp Leu Tyr Gly Yal Yal Phe Ala Yal Leu Ala Thr lie Leu Phe 

50 55. 60 
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ACC GTG GGC GCC TAT TGG TGG CCG GTG CTG TGG TGG ATC GCC CTG GGC 
Thr Yal Gly Ala Tyr Trp Trp Pro Yal Leu Trp Trp lie Ala Leu Gly 
65 70 75 80 

ATG ACG GTC TAT GGG TTG ATC TAT TTC ATC CTG CAC G AC GGG CTT GTG 
Met Tbr Yal Tyr Gly Leu lit Tyr Phe lie Leu His Asp Gly Leu Yal 

85 90 95 

CAT CAA CGC TGG CCG TTT CGG TAT ATT CCG CGG CGG GGC TAT TTC CGC 
His Gin Arg Trp Pro Pbe Arg Tyr He Pro Ar g Arg Gly Tyr Phe Arg 

100 105 HO 

AGG CTC TAC CAA GCT CAT CGC CTG CAC CAC GCG GTC GAG GGG CGG CAC 
Arg Leu Tyr Gin Ala His Arg Leu His His Ala Yal Glu Gly Arg Asp 

115 120 125 

CAC TGC GTC AGC TTC GGC TTC ATC TAT GCC CCA CCC GTG GAC AAG CTG 
His Cys Yal Ser Phe Gly Phe He Tyr Ala Pro Pro Yal Asp Lys Leu 

130 ' 135 HO 

AAG CAG GAT CTG AAG CGG TCG GGT GTC CTG CGC CCC CAG GAC GAG CGT 
Lys Gin Asp Leu Lys Arg Ser Gly Yal Leu Arg Pro Gin Asp Glu Arg 
145 150 155 160 

CCG TCG TGA 
Pro Ser *** 
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SEQ ID NO: 7 

SEQUENCE LENGTH: 1631 

SEQUENCE TYPE: STRANDEDNESS : 



double 



TOPOLOGY: 



linear 



MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE: 



ORGANISM: 



Alcaligenes 



STRAIN: 



sp. PC-1 



SEQUENCE 



CTGCAGGCCG GGCCCGGTGG CCAATGGTCG CAACCGGCAG GACTGGAACA GGACGGCGGG 60 

GACGTCCGGC CCGGGCCACC GGTTACCAGC. GTTGGCCGTC CTGACCTTGT CCTGCCGCCC 

CCGGTCTAGG CTGTCGCCCT ACGCAGCAGG AGTTTCGGAT GTCCGGACGG AAGCCTGGCA 120 

GGCCAGATCC GACAGCGGGA TGCGTCGTCC TGAAAGCCTA CAGGCCTGCC TTCGGACCGT 

CAACTGGCGA CACGATCGTC AATCTCGGTC TCACCGCCGC GATCCTGCTG TGCTGGCTGG 180 

GTTGACCGCT GTGCTAGCAG TTAGAGCCAG ACTGGCGGCG CTAGGACGAC ACGACCGACC 

TCCTGCACGC CTTTACGCTA TGGTTGCTAG ATGCGGCCGC GCATCCGCTG CTTGCCGTGC 240 

AGGACGTGCG GAAATGCGAT ACCAACGATC TACGCCGGCG CGTAGGCGAC GAACGGCACG 
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TGTCCCTGGC TGGGCTGACC TGGCTGTCGG TCGGGCTGTT CATCATCGCG 

ACACGGACCG ACCCGACTGG ACCGACAGCC AGCCCGACAA GTAGTAGCGC 

TGCACGCGTC CGTGGTGCCG GGGCGGCCGC GCGCCAATGC GGCGATCGGG CAACTGGCGC 360 

ACGTGCCCAG GCACCACGGC CCCGCCGGCG CGCGGTTACG CCGCTAGCCC GTTGACCGCG 

TGTGGCTCTA TGCGGGGTTC TCGTGGCCCA AGCTGATCGC CAAGCACATG ACGCATCACC 420 

ACACCGAGAT ACGCCCCAAG AGCACCGGGT TCGACTAGCG GTTCGTGTAC TGCGTAGTGG 

GGCACGCCGG CACCGACAAC GATCCCGATT TCGGTCACGG AGGGCCCGTG CGCTGGTACG 480 

CCGTGCGGCC GTGGCTGTTG CTAGGGCTAA AGCCAGTGCC TCCCGGGCAC GCGACCATGC 

GCAGCTTCGT CTCCACCTAT TTCGGCTGGC GAGAGGGACT GCTGCTACCG GTGATCGTCA 540 

CGTCGAAGCA GAGGTGGATA AAGCCGACCG CTCTCCCTGA CGACGATGGC CACTAGCAGT 

CCACCTATGC GCTGATCCTG GGCGATCGCT GGATGTATGT CATCTTCTGG CCGGTCCCGG 600 

GGTGGATACG CGACTAGGAC CCGCTAGCGA CCTACATACA GTAGAAGACC GGCCAGGGCC 

CCGTTCTGGC GTCGATCCAG ATTTTCGTCT TCGGAACTTG GCTGCCCCAC CGCCCGGGAC 660 

GGCAAGACCG CAGCTAGGTC TAAAAGCAGA AGCCTTGAAC CGACGGGGTG GCGGGCCCTG 

ATGACGATTT TCCCGACCGG CACAACGCGA GGTCGACCGG CATCGGCGAC CCGTTGTCAC 720 

TACTGCTAAA AGGGCTGGCC GTGTTGCGCT CCAGCTGGCC GTAGCCGCTG GGCAACAGTG 




CATGACGCAA 300 
GTACTGCGTT 
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TACTGACCTG CTTCCATTTC GCCCGCTATC ACCACGAACA TCACCTGCAT CCGCATG7GC 780 
ATGACTGGAC GAAGGTAAAG CCGCCGATAG TGGTGCTTGT AGTGGACGTA GGCGTACACG 

CGTGGTGGCG CCTGCCTCGT ACACGCAAGA CCGGAGGCCG CGCATGACGC AATTCCTCAT 840 
GCACCACCGC GGACGGAGCA TCTGCGTTCT GGCCTCCGGC GCGTACTGCG TTAAGGAGTA 

TGTCGTGGCG ACAGTCCTCG TGATGGAGCT GACCGCCTAT TCCGTCCACC GCTGGATTAT 900 
ACAGCACCGC TGTCAGGAGC ACTACCTCGA CTGGCGGATA AGGCAGGTGG CGACCTAATA 



GCACGGCCCC CTAGGCTGGG GCTGGCACAA GTCCCATCAC GAAGAGCACG ACCACGCGTT 960 
CGTGCCGGGG GATCCGACCC CGACCGTGTT CAGGGTAGTG CTTCTCGTGC TGGTGCGCAA 

GGAGAAGAAC GACCTCTACG GCGTCGTCTT CGCGGTGCTG GCGACGATCC TCTTCACCGT 1020 
CCTCTTCTTG CTGGAGATGC CGCAGCAGAA GCGCCACGAC CGCTGCTAGG AGAAGTGGCA 

GGGCGCCTAT TGGTGGCCGG TGCTGTGGTG GATCGCCCTG GGCATGACGG TCTATGGGTT 1080 
CCCGCGGATA ACCACCGGCC ACGACACCAC CTAGCGGGAC CCGTACTGCC AGATACCCAA 

GATCTATTTC ATCCTGCACG ACGGGCTTGT GCATCAACGC TGGCCGTTTC GGTATATTCC 1 1 40 
CTAGATAAAG TAGCACGTGC TGCCCGAACA CGTAGTTGCG ACCGGCAAAG CCATATAAGG 

GCGGCGGGGC TATTTCCGCA GGCTCTACCA AGCTCATCGC CTCCACCACG CGGTCGAGGG 1200 
CGCCGCCCCG ATAAAGGCGT CCGAGATGGT TCGAGTAGCG GACGTGGTGC GCCAGCTCCC 
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GCGGGACCAC TGCGTCAGCT TCGGCTTCAT CTATGCCCCA CCCGTGGACA AGCTGAAGCA 1 2 6 0 

CGCCCTGGTG ACGCAGTCGA ACCCGAAGTA GATACGGGGT GGGCACCTGT TCGACTTCGT 

GGATCTGAAG CGGTCGGGTG TCCTGCGCCC CCAGGACGAG CGTCCGTCGT GATCTCTGAT 1 3 20 

CCTAGACT7C GCCACCCCAC ACGACGCGGG GGTCCTGCTC GCAGGCAGCA CTAGAGACTA 

CCCGGCGTGG CCGCATGAAA TCCGACGTGC TGCTGGCAGG GGCCGGCCTT GCCAACGGAC 1380 

GGGCCGCACC GGCGTACTTT AGGCTGCACG ACGACCGTCC CCGGCCGGAA CGGTTGCCTG 

TGATCGCGCT GGCGATCCGC AAGGCGCGGC CCGACCTTCG CGTGCTGCTG CTGGACCGTG 1440 

ACTAGCGCGA CCGCTAGGCG TTCCGCGCCG GCCTGGAAGC GCACGACGAC GACCTGGCAC 

CGGCGGGCGC CTCGGACGGG CATACTTGGT CCTGCCACGA CACCGATTTG GCGCCGCACT 1500 

GCCGCCCGCG GAGCCTGCCC GTATGAACCA GCACGGTGCT GTGGCTAAAC CGCGGCGTGA 

GGCTGGACCG CCTGAAGCCG ATCAGGCGTG GCGACTGGCC CGATCAGGAG GTGCGGTTCC 1560 

CCGACCTGGC GGACTTCGGC TACTCCCCAC CGCTGACCGG GCTAGTCCTC CACGCCAAGG 

CAGACCATTC GCGAAGGCTC CGGGCCGGAT ATGGCTCGAT CGACGGGCGG GGGCTGATGC 1620 

GTCTGGTAAG CGCTTCCGAG GCCCGGCCTA TACCGAGCTA GCTGCCCGCC CCCGACTACG 

GTGCGGTGAC C 1631 
CACGCCACTG G 



EP 07351 37A1 J > 



46 



EP 0 735 137 A1 



Claims 



1 . A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
a methylene group at the 4-posttion of a p-ionone ring into a keto group. 

2 A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
the methylene group at the 4-position of the p-ionone ring into a keto group and having an amino acid sequence 
substantially of amino acid Nos. 1 - 212 which is shown in the SEQ ID NO: 1 . 

3. A DNA strand hybridizing the DNA strand according to claim 2 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 2. 

4. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
the methylene group at the 4-position of the p-ionone ring into a keto group and having an amino acid sequence 
substantially of amino acid Nos. 1 • 242 which is shown in the SEQ ID NO: 5. 

5. A DNA strand hybridizing the DNA strand according to claim 4 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 4. 

6. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
p-carotene into canthaxanthin by way of echinenone and having an amino acid sequence substantially of amino 
acid Nos. 1 - 212 which is shown in the SEQ ID NO: 1. 

7. A DNA strand hybridizing the DNA strand according to claim 6 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 6. 

8 A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
p-carotene into canthaxanthin by way of echinenone and having an amino acid sequence substantially of amino 
acid Nos. 1 - 242 which is shown in the SEQ ID NO: 5. 

9. A DNA strand hybridizing the DNA strand according to claim 8 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 8. 

1 0. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
the methylene group at the 4-position of the 3-hydroxy-p-ionone ring into a keto group. 

1 1 A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
the methylene group at the 4-position of the 3-hydroxy-p-ionone ring into a keto group and having an amino acid 
sequence substantially of amino acid Nos. 1-212 which is shown in the SEQ ID NO: 1 

12. A DNA strand hybridizing the DNA strand according to claim 1 1 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 11. 

1 3. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
the methylene group at the 4-position of the 3-hydroxy-p-ionone ring into a keto group and having an ammo acid 
sequence substantially of amino acid Nos. 1 - 242 which is shown in the SEQ ID NO: 5. 

14. A DNA strand hybridizing the DNA strand according to claim 13 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 13. 

1 5. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
zeaxanthin into astaxanthin by way of 4-ketozeaxanthin and having an amino acid sequence substantially of ammo 
acid Nos. 1 - 212 which is shown in the SEQ ID NO: 1. 

16. A DNA strand hybridizing the DNA strand according to claim 15 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 15. 
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17. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
zeaxanthin into astaxanthin by way of 4-ketozeaxanthin and having an amino acid sequence substantially of amino 
acid Nos. 1 - 242 which is shown in the SEQ ID NO: 5. 

18. A DNA strand hybridizing the DNA strand according to claim 1 7 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 17. 

19. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for adding a 
hydroxyl group to the 3-carbon of the 4-keto-p-ionone ring. 

20. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for adding a 
hydroxyl group to position 3-carbon of the 4-keto-p-ionone ring and having an amino acid sequence substantially 
of amino acid Nos. 1 - 1 62 which is shown in the SEQ ID NO: 2. 

is 21 . A DNA strand hybridizing the DNA strand according to claim 20 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 20. 

22. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for adding a 
hydroxyl group to position 3-carbon of the 4-keto-p-ionone ring and having an amino acid sequence substantially 

20 of amino acid Nos. 1 - 1 62 which is shown in the SEQ ID NO: 6. 

23. A DNA strand hybridizing the DNA strand according to claim 22 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 22. 

25 24. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
canthaxanthin into astaxanthin by way of phoenicoxanthin and having an amino acid sequence substantially of 
amino acid Nos. 1-162 which is shown in the SEQ ID NO: 2. 

25. A DNA strand hybridizing the DNA strand according to claim 24 and having a nucleotide sequence which encodes 
30 a polypeptide having an enzyme activity according to claim 24. 

26. A DNA strand having a nucleotide sequence which encodes a polypeptide having an enzyme activity for converting 
canthaxanthin into astaxanthin by way of phoenicoxanthin and having an amino acid sequence substantially of 
amino acid Nos. 1-162 which is shown in the SEQ ID NO: 6. 



35 



27. A DNA strand hybridizing the DNA strand according to claim 26 and having a nucleotide sequence which encodes 
a polypeptide having an enzyme activity according to claim 26. 

28. A process for producing a xanthophyll comprising introducing the DNA strand according to any one of claims 1 - 9 
40 into a microorganism having a p-carotene-synthesizing ability, culturing the transformed microorganism in a culture 

medium, and obtaining canthaxanthin or echinenone from the cultured cells. 

29. A process for producing a xanthophyll comprising introducing the DNA strand according to any one of claims 10 - 
18 into a microorganism having a zeaxanthin-synthesizing ability, culturing the transformed microorganism in a cul- 

45 ture medium, and obtaining astaxanthin or 4-ketozeaxanthin from the cultured cells. 

30. A process for producing a xanthophyll comprising introducing the DNA strand according to any one of claims 19 - 
27 into a microorganism having a canthaxanthin -synthesizing ability, culturing the transformed microorganism in a 
culture medium, and obtaining astaxanthin or phoenicoxanthin from the cultured cells. 

50 

31 . A process for producing a xanthophyll according to any one of claims 28-30, wherein the microorganism is a bac- 
terium or yeast. 

55 
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^ 237 246 255 264 273 282 

GTG CAT GCG CTG TGG TTT CTG GAC GCA GCG GCG CAT CCC ATC CTG GCG ATC GCA 
Met His Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro lie Leu Ala lie Ala 

29i 300 309 318 327 336 

AAT TTC CTG GGG CTG ACC TGG CTG TCG GTC GGA TTG TTC ATC ATC GCG CAT GAC 
Asn ?he Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe lie He Ala His Asp 

345 354 363 372 381 390 

GCG ATG CAC GGG TCG GTG GTG CCG GGG CGT CCG CGC GCC AAT GCG GCG ATG GGC 
Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Ala Asn Ala Ala Met Gly 

399 408 417 426 435 <<< 

CAG CTT GTC CTG TGG CTG TAT GCC GGA TTT TCG TGG CGC AAG ATG ATC GTC AAG 
Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe Ser Trp Arg Lys Met He Val Lys 

453 462 471 480 489 498 

CAC ATG GCC CAT CAC CGC CAT GCC GGA ACC GAC GAC GAC CCC GAT TTC GAC CAT 
His Met Ala His His Arg His Ala Gly Thr Asp Asp Asp Pro Asp Phe Asp His 

507 516 525 534 543 552 

GGC GGC CCG GTC CGC TGG TAC GCC CGC TTC ATC GGC ACC TAT TTC GGC TGG CGC 
Gly Gly Pro Val Arg Trp Tyr Ala Arg Phe He Gly Thr Tyr Phe Gly Trp Arg 

561 570 579 588 597 606 

GAG GGG CTG CTG CTG CCC GTC ATC GTG ACG GTC TAT GCG CTG ATC CTT GGG GAT 
Glu Gly Leu Leu Leu Pro Val He Val Thr Val Tyr Ala Leu He Leu Gly Asp 

615 624 633 642 651 660 

CGC TGG ATG TAC GTG GTC TTC TGG CCG CTG CCG TCG ATC CTG GCG TCG ATC CAG 
Arg Trp Met Tyr Val Val Phe Trp Pro Leu Pro Ser He Leu Ala Ser He Gin 

669 678 687 696 705 714 

CTG TTC GTG TTC GGC ACC TGG CTG CCG CAC CGC CCC GGC CAC GAC GCG TTC CCG 
Leu Phe Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 

123 732 741 750 759 7 *8 

GAC CGC CAC AAT GCG CGG TCG TCG CGG ATC AGC GAC CCC GTG TCG CTG CTG ACC 
Asp Arg His Asn Ala Arg Ser Ser Arg He Ser Asp Pro Val Ser Leu Leu Thr 

777 786 795 804 813 822 

7GC TTT CAC TTT GGC GGT TAT CAT CAC GAA CAC CAC CTG CAC CCG ACG GTG CCG 
Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His Pro Thr Val Pro 

831 840 349 858 86*7 

TGG TGG CGC CTG CCC AGC ACC CGC ACC AAG GGG GAC ACC GCA TGA 
Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp Thr Ala " w * 



. B 

F I G. I 



49 



BNSDOCID: <EP 0735137A1_I_> 



EP 0 735 137 A1 



f 872 88! 890 899 908 917 

ATG ACC AAT TTC CTG ATC GTC GTC GCC ACC GTG CTG GTG ATG GAG TTG ACG GCC 
Met Thr Asn Phe Leu He Val Val Ala Thr val Leu val Met Glu Leu Thr Ala 

926 935 944 953 962 971 

TAT TCC GTC CAC CGC TGG ATC ATG CAC GGC CCC CTG GGC TGG GGC TGG CAC AAG 
Tyr Ser Val His Arg Trp He Met His Gly Pro Leu Gly Trp Gly Trp His Lys 

980 989 998 1007 1016 1025 

TCC CAC CAC GAG GAA CAC GAC CAC GCG CTG GAA AAG A AC GAC CTG TAC GGC CTG 
Ser His His Glu Glu His Asp His Ala Leu Glu Lys Asn Asp Leu Tyr Gly Leu 

1034 1043 1052 10S1 1070 1079 

GTC TTT GCG GTG ATC GCC ACG GTG CTG TTC ACG GTG GGC TGG ATC TGG GCG CCG 
val Phe Ala val He Ala Thr Val Leu Phe Thr val Gly Trp He Trp Ala Pro 

1088 1097 1106 HIS H24 H33 

GTC CTG TGG TGG ATC GCC TTG GGC ATG ACT GTC TAT GGG CTG ATC TAT TTC GTC 
Val Leu Trp Trp He Ala Leu Gly Met Thr Val Tyr Gly Leu He Tyr Phe val 

1142 1151 1160 1169 in8 1187 

CTG CAT GAC GGG CTG GTG CAT CAG CGC TGG CCG TTC CGT TAT ATC CCG CGC AAG 
Leu His Asp Gly Leu Val His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Lys 

1196 1205 1214 1223 1232 M 

GGC TAT GCC AGA CGC CTG TAT CAG GCC CAC CGC CTG CAC CAT GCG GTC GAG GGG 
Gly Tyr Ala Arg Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly 

1250 1259 1268 1277 1286 1295 

CGC GAC CAT TGC GTC AGC TTC GGC TTC ATC TAT GCG CCC CCG GTC GAC AAG CTG 
Arg Asp His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 

1304 1313 1322 1331 1340 1349 

AAG CAG GAC CTG AAG ATG TCG GGC GTG CTG CGG GCC GAG GCG CAG GAG CGC ACG 
Lys Gin Asp Leu Lys Met Ser Gly Val Leu Arg Ala Glu Ala Gin Glu Arg Thr 

TGA I 

::: d 



FIG. 2 



BNSDOCID: <EP 07351 37 A1_l_> 



50 



EP 0 735 137 A1 



E 

Nf 1357 1366 1375 1384 1393 1402 

GTG ACC CAT GAC GTG CTG CTG GCA GGG GCG GGC CTT GCC AAC GGG CTG ATC GCC 

Met Thr Kis Asp Val Leu Leu Ala Gly Ala Gly Leu Ala Asn Gly Leu He Ala 

1411 1420 1429 1433 1447 1456 

CTG GCG CTG CGC GCG GCG CGG CCC GAC CTG CGC GTG CTG CTG CTG GAC CAT GCC 
Leu Ala Leu Arg Ala Ala Arg Pro Asp Leu Arg Val Leu Leu Leu Asp His Ala 

.1465 1474 1483 1492 1501 1510 

GCA GGA CCG TCA GAC GGC CAC ACC TGG TCC TGC CAC GAC CCC GAC CTG TCG CCG 
Ala Gly Pro Ser Asp Gly His Thr Trp Ser Cys His Asp Pro Asp Leu Ser Pro 

1519 1528 1537 1546 1555 1564 

GAC TGG CTG GCG CGG CTG AAG CCC CTG CGC CGC GCC AAC TGG CCC GAC CAG GAG 
Asp Trp Leu Ala Arg Leu Lys Pro Leu Arg Arg Ala Asn Trp Pro Asp Gin Glu 

15-73 1582 1591 1600 1609 1618 

GTG CGC TTT CCC CGC CAT GCC CGG CGG CTG GCC ACC GGT TAC GGG TCG CTG GAC 
val Arg Phe Pro Arg His Ala Arg Arg Leu Ala Thr Gly Tyr Gly Ser Leu Asp 

1627 1536 1645 1654 1663 1672 

GGG GCG GCG CTG GCG GAT GCG GTG GTC CGG TCG GGC GCC GAG ATC CGC TGG GAC 
Gly Ala Ala Leu Ala Asp Ala Val Val Arg Ser Gly Ala Glu He Arg Trp A»sp 

1681 1690 1699 1708 1717 1726 

AQC GAC ATC GCC CTG CTG GAT GCG CAG GGG GCG ACG CTG TCC TGC GGC ACC CGG 
Ser Asp He Ala Leu Leu Asp Ala Gin Gly Ala Thr Leu Ser Cys Gly Thr Arg 

1735 174.4 1753 1762 1771 1780 

ATC GAG GCG GGC GCG GTC CTG GAC GGG CGG GGC GCG CAG CCG TCG CGG CAT CTG 
He Glu Ala Gly Ala Val Leu Asp Gly Arg Gly Ala Gin Pro Ser Arg His Leu 

1789 1793 1807 1816 1825 1834 

ACC GTG GGT TTC CAG AAA TTC GTG GGT GTC GAG ATC GAG ACC GAC CGC CCC CAC 
Thr Val Gly Phe Gin Lys Phe Val Gly Val Glu He Glu Thr Asp Arg Pro His 

1843 1852 1861 1870 1879 1888 

GGC GTG CCC CGC CCG ATG ATC ATG GAC GCG ACC GTC ACC CAG CAG GAC GGG TAC 
Gly Val Pro Arg Pro Met He Met Asp Ala Thr Val Thr Gin Gin Asp Gly Tyr 

1897 1906 1S15 1524 1933 1942 

CGC TTC ATC TAT CTG CTG CCC TTC TCT CCG ACG CGC ATC CTG ATC GAG GAC ACG 
Arg Phe He Tyr Leu Leu Pro Phe Ser Pro Thr Arg He Leu He Glu Asp Thr 

1951 1960 1969 1578 1987 1996 

CGC TAT TCC GAT GGC GGC GAT CTG GAC GAC GAC GCG CTG GCG GCG GCG TCC CAC 
Arg Tyr Ser Asp Gly Gly Asp Leu Asp Asp Asp Ala Leu Ala Ala Ala Ser His 
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2005 2014 2023 

GAC TAT GCC CGC CAG CAG GGC TGG ACC 
Asp Tyr Ala Arg Gin Gin Gly Trp Thr 

2059 2068 2077 

ATC CTT CCC ATC GCG CTG GCC CAT GAT 
lie Leu Pro He Ala Leu Ala His Asp 

2113 2122 2131 

GCG GGG CCT GTT CCC GTG GGA CTG CGC 
Ala Gly Pro Val Pro Val Gly Leu Arg 



2032 2041 2050 

GGG GCC GAG GTC CGG CGC GAA CGC GGC 
Gly Ala Glu Val Arg Arg Glu Arg Gly 

2086 2095 2104 

GCG GCG GGC TTC TGG GCC GAT CAC GCG 
Ala Ala Gly ?he Trp Ala Asp His Ala 

2140 2149 2158 

GCG GGG TTC TTT CAT CCG GTC ACC GGC 
Ala Gly Phe Phe His Pro val Thr Gly 



2167 2176 
TAT TCG CTG CCC TAT GCG 
Tyr Ser Leu Pro Tyr Ala 

2221 2230 
CCG CCC GGC ACC GAC GCG 
Pro Pro Gly Thr Asp Ala 

2275 2284 
GCG CGC CGC GAC CGC TTT 
Ala Arg Arg Asp Arg Phe 

2329 2338 
GCG CCC GAC CGG CGC TAT 
Ala Pro Asp Arg Arg Tyr 

2383 2392 
CTG ATC GAA CGG TTC TAT 
Leu He *Glu Arg Phe Tyr 

2437 2446 
GTG ACC GGC AAG CCT CCC 
Val Thr Gly Lys Pro Pro 

2491 2500 
CGT CCC CTG CTG AAG GAA 
Arg Pro Leu Leu Lys Glu 



2185 2194 
GCA CAG GTG GCG GAC GTG 
Ala Gin Val Ala Asp Val 

2239 2248 
CTG CGC GGC GCC ATC CGC 
Leu Arg Gly Ala He Arg 

2293 2302 
CTG CGC CTT TTG AAC CGG 
Leu Arg Leu Leu Asn Arg 

2347 2356 
ACC CTG CTG CAG CGG TTC 
Thr Leu Leu Gin Arg Phe 

2401 2410 
GCC GGC CGG CTG AGC GTG 
Ala Gly Arg Leu Ser Val 

2455 2464 
ATT CCC CTT GGC ACG GCC 
lie Pro Leu Gly Thr Ala 

2509 
AAC GCA TGA 
Asn Ala *** 

F 



2203 2212 
GTG GCG GGT CTG TCC GGG 
Val Ala Gly Leu Ser Gly 

225*7 2266 
GAT TAC GCG ATC GAC CGG 
Asp Tyr Ala He Asp Arg 

2311 2320 
ATG CTG TTC CGC GGC TGC 
Met Leu Phe Arg Gly Cys 

2365 2374 
TAC CGC ATG CCG CAT GGA 
Tyr Arg Met Pro His Gly 

2419 2428 
GCG GAT CAG CTG CGC ATC 
Ala Asp Gin Leu Arg He 

2473 2482 
ATC CGC TGC CTG CCC GAA 
He Arg Cys Leu Pro Glu 
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10 20 30 40 50 60 

■ - 

GGATC CGGCG ACCTT GCGGC GCTGC GCCGC GCGCC TTTGC TGGTG CCTGG GCCGG GTGGC 
CCTAG GCCGC TGGAA CGCCG CGACG CGGCG CGCGG AAACG ACCAC GGACC CGGCC CACCG 

70 80 90 100 110 120 

CAATG GTCGC AAGCA ACGGG GATGG AAACC GGCGA TGCGG GACTG TAGTC TGCGC GGATC 
GTTAC CAGCG 7TCGT TGCCC CTACC TTTGG CCGCT ACGCC CTGAC ATCAG ACGCG CCTAG 

130 140 150 160 170 180 

• m * * * * 

GCCGG TCCGG GGGAC AAGAT GAGCG CACAT GCCCT GCCCA AGGCA GATCT GACCG CCACC 
CGGCC AGGCC CCCTG TTCTA CTCGC GTGTA CGGGA CGGGT TCCGT CTAGA CTGGC GGTGG 

190 2C0 210 220 A 230 240 

AGCCT GATCG 7CTCG GGCGG CATCA TCGCC GCTTG GCTGG CCCTG CATGT GCATG CGCTG 
TCGGA CTAGC AGAGC CCGCC GTAGT AGCGG CGAAC CGACC GGGAC GTACA CGTAC GCGAC 

250 260 270 280 290 300 

TGGTT TCTGG ACGCA GCGGC GCATC CCATC CTGGC GATCG CAAAT TTCCT GGGGC TGACC 
ACCAA AGACC TGCGT CGCCG CGTAG GGTAG GACCG CTAGC GTTTA AAGGA CCCCG ACTGG 

310 320 330 340 350 360 

m « * * * * 

TGGCT GTCGG TCGGA TTGTT CATCA TCGCG CATGA CGCGA TGCAC GGGTC GGTGG TGCCG 
ACCGA CAGCC AGCCT AACAA GTAGT AGCGC GTACT GCGCT ACGTG CCCAG CCACC ACGGC 

370 3S0 390 400 410 420 

GGGCG TCCGC GCGCC AATGC GGCGA TGGGC CAGCT TGTCC TGTGG CTGTA TGCCG GATTT 
CCCGC AGGCG CGCGG TTACG CCGCT ACCCG GTCGA ACAGG ACACC GACAT ACGGC CTAAA 

430 440 450 460 470 480 

* * * * * * 

TCGTG GCGCA AGATG ATCGT CAAGC ACATG GCCCA TCACC GCCAT GCCGG AACCG ACGAC 
AGCAC CGCGT TCTAC TAGCA GTTCG TGTAC CGGGT AGTGG CGGTA CGGCC TTGGC TGCTG 

490 500 510 520 530 540 

. * * -m * * 

GACCC CGAT7 7CGAC CATGG CGGCC CGGTC CGCTG GTACG CCCGC TTCAT CGGCA CCTAT 
CTGGG GCTAA AGCTG GTACC GCCGG GCCAG GCGAC CATGC GGGCG AAGTA GCCGT GGATA 

550 560 570 530 590 600 

TTCGG CTGGC GCGAG GGGCT GCTGC TGCCC GTCAT CGTGA CGGTC TATGC GCTGA TCCTT 
AAGCC GACCG CGCTC CCCGA CGACG ACGGG CAGTA GCACT GCCAG ATACG CGACT AGGAA 

610 620 630 640 650 660 

a * » » * * 

GGGGA 7CGCT GGATG TACG7 GG7CT 7C7GG CCGC7 GCCG7 CGATC CTGGC GTCGA 7CCAG 
CCCC7 AGCGA CC7AC A7GCA CCAGA AGACC GGCGA CGGCA GC7AG GACCG CAGC7 AGGTC 
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670 630 6S0 700 710 720 

■ » • * mm 

CTGTT CGTGT TCGGC ACCTG GCTGC CGCAC CGCCC CGGCC ACGAC GCGTT CCCGG ACCGC 
GACAA GCACA AGCCG TGGAC CGACG GCGTG GCGGG GCCGG TGCTG CGCAA GGGCC TGGCG 

730 740 750 760 770 780 

* * * • * « 

CACAA TGCGC GGTCG TCGCG GATCA GCGAC CCCGT GTCGC TGCTG ACCTG CTTTC ACTTT 
GTGTT ACGCG CCAGC AGCGC CTAGT CGCTG GGGCA CAGCG ACGAC TGGAC GAAAG TGAAA 

790 800 810 820 830 840 

« m m * • • 

GGCGG TTATC ATCAC GAACA CCACC TGCAC CCGAC GGTGC CGTGG TGGCG CCTGC CCAGC 
CCGCC AATAG TAGTG CTTGT GGTGG ACGTG GGCTG CCACG GCACC ACCGC GGACG GGTCG 

850 860 G 870 880 890 900 

ACCCG CACCA AGGGG GACAC CGCAT GACCA ATTTC CTGAT CGTCG TCGCC ACCGT GCTGG 
TGGGC GTGGT TCCCC CTGTG GCGTA CTGGT TAAAG GACTA GCAGC AGCGG TGGCA CGACC 

910 920 g 930 940 950 960 

* * * * * ■* 

TGATG GAGTT GACGG CCTAT TCCGT CCACC GCTGG ATCAT GCACG GCCCC CTGGG CTGGG 
ACT AC CTCAA CTGCC GGATA AGGCA GGTGG CGACC TAGTA CGTGC CGGGG GACCC GACCC 

970 980 950 1000 1010 1020 

* * * » * . * 

GCTGG CACAA GTCCC ACCAC GAGGA ACACG ACCAC GCGCT GGAAA AGAAC GACCT GTACG 
CGACC GTGTT CAGGG TGGTG CTCCT TGTGC TGGTG CGCGA CCTTT TCTTG CTGGA CATGC 

1030 1040 1050 1060 1070 1080 

GCCTG GTCTT TGCGG TGATC GCCAC GGTGC TGTTC ACGGT GGGCT GGATC TGGGC GCCGG 
CGGAC CAGAA ACGCC ACTAG CGGTG CCACG ACAAG TGCCA CCCGA CCTAG ACCCG CGGCC 

1090 1100 1110 1120 1130 1140 

TCCTG TGGTG GATCG CCTTG GGCAT GACTG TCTAT GGGCT GATCT ATTTC GTCCT GCATG 
AGGAC ACCAC CTAGC GGAAC CCGTA CTGAC AGATA CCCGA CTAGA TAAAG CAGGA CGTAC 

1150 1160 1170 1180 1190 1200 

* * * * 

ACGGG CTGGT GCATC AGCGC TGGCC GTTCC GTTAT ATCCC GCGCA AGGGC TATGC CAGAC 
TGCCC GACCA CGTAG TCGCG ACCGG CAAGG CAATA TAGGG CGCGT TCCCG ATACG GTCTG 

1210 1220 1230 1240 12S0 1260 

* « * * * 

GCCTG TATCA GGCCC ACCGC CTGCA CCATG CGGTC GAGGG GCGCG ACCAT TGCGT CAGCT 
CGGAC ATAGT CCGGG TGGCG GACGT GGTAC GCCAG CTCCC CGCGC TGGTA ACGCA GTCGA 

1270 12S0 1290 1300 1310 1320 

• • « * 

TCGGC TTCAT CTATG CGCCC CCGGT CGACA AGCTG AAGCA GGACC TGAAG ATGTC GGGCG 
AGCCG AAGTA GAT AC GCGGG GGCCA GCTGT TCGAC TTCGT CCTGG ACTTC TACAG CCCGC 
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133 0 1340 E 13 , 50 1360 1370 1360 

TGCTG CGGGC CGAGG CGCAG GAGCG CaJgT GACCC ATGAC GTGCT GCTGG CAGGG GCGGG 
ACGAC GCCCG GCTCC GCGTC CTCGC GTGCA CTGGG TACTG CACGA CGACC GxCCC CGCCC 



1390 1400 



141oD 1420 1430 1440 



CCttG CCP'-C GGGCT GATCG CCCTG GCGCT GCGCG CGGCG CGGCC CGACC TGCGC GTGCT 
GG.Uc GGTTG CCCGA CTAGC GGGAC CGCGA CGCGC GCCGC GCCGG GCTGG ACGCG CACGA 

1450 1460 . 1470 1480 1490 1500 

GCTGC TGG^C CATGC CGCAG GACCG TCAGA CGGCC ACACC TGGTC CTGCC ACGAC CCCGA 
CGACG £cTC GTACG GCGTC CTGC-C AGTCT GCCGG TGTGG ACCAG GACGG TGCTG GGGCT 

1510 1520 1530 1540 1550 1560 

* * * * 

CC-G- CGCCG GACTG GCTGG CGCGG CTGAA GCCCC TGCGC CGCGC CAACT GGCCC GACCA 
GGACI GCGGC CTGAC CGACC GCGCC GACTT CGGGG ACGCG GCGCG GTTGA CCGGG CTGGT 

1570 1580 1590 1600 1610 1620 

— ^ ^ •* * 

GC-^C-G TGCGC TTTCC CCGCC ATGCC CGGCG GCTGG CCACC GGTTA CGGGT CGCTG GACGG 
CCTCC ACGCG AAAGG GGCGG TACGG GCCGC CGACC GGTGG CCAAT GCCCA GCGAC CTGCC 

1630 1640 1650 1660 1670 1630 

w * ' ' ' * * * 

GGCGG CGCTG GCGGA TGCGG TGGTC CGGTC GGGCG CCGAG. ATCCG CTGGG ACAGC GACAT 
CCGCC GCGAC CGCCT ACGCC ACCAG GCCAG CCCGC GGCTC TAGGC GACCC TGTCG CTGTA 

1590 1700 1710 1720 1730 1740 

CGCCC TGCTG GATGC GCAGG GGGCG ACGCT GTCCT GCGGC ACCCG GATCG AGGCG GGCGC 
GCGGG ACGAC CTACG CGTCC CCCGC TGCGA CAGGA CGCCG TGGGC CTAGC TCCGC CCGCG 

n 50 1760 1770 1780 1790 1800 

« - * * * 

GGTCC TGGAC GGGCG GGGCG CGCAG CCGTC GCGGC ATCTG ACCGT GGGTT TCCAG AAATT 

CCAGG ACCTG CCCGC CCCGC GCGTC GGCAG CGCCG TAGAC TGGCA CCCAA AGGTC TTTAA 



1810 



1320 1830 1840 1850 I860 



CGTGG GTGTC GAG AT CGAGA CCGAC CGCCC CCACG GCGTG CCCCG CCCGA TGATC ATGGA 
GCACC CACAG CTCTA GCTCT GGCTG GCGGG GGTGC CGCAC GGGGC GGGCT ACT AG TACCT 

1870 19e0 1890 1900 1910 1920 

w w » » 

CGCGA CCGTC ACCCA GCAGG ACGGG TACCG CTTCA TCTAT CTGCT GCCCT TCTCT CCGAC 
GCGCT GGCAG TGGGT CGTCC TGCCC ATGGC GAAGT AG AT A GACGA CGGGA AG AG A GGCTG 



1930 



19 40 1950 I960 1970 1980 



GCGO TCCTG ATCGA GGACA CGCGC TATTC CGATG GCGGC GATCT GGACG ACGAC GCGCT 
CGCGT AGGAC TAGCT CCTGT GCGCG ATAAG GCTAC CGCCG CTAGA CCTGC TGCTG CGCGA 

F I G. 7 



55 



BNSDOCID: <EP 0735137Al_L> 



EP 0 735 137 A1 



19S0 

GGCGG CGGCG 
CCGCC GCCGC 

2050 

CGAAC GCGGC 
GCTTG CGCCG 

2110 

CGCGG CGGGG 
GCGCC GCCCC 

2170 

m 

TTCGC TGCCC 
AAGCG ACGGG 



2000 

TCCCA CGACT 
AGGGT GCTGA 

2060 

ATCCT TCCCA 
TAGGA AGGGT 

2120 
* 

CCTGT TCCCG 
GGACA AGGGC 

2180 

TATGC GGCAC 
ATACG CCGTG 



2010 

ATGCC CGCCA 
TACGG GCGGT 

2070 
* 

TCGCG CTGGC 
AGCGC GACCG 

2130 

w 

TGGGA CTGCG 
ACCCT GACGC 

2190 

AGGTG GCGGA 
TCCAC CGCCT 



2020 

GCAGG GCTGG 
CGTCC CGACC 

2080 

m 

CCATG ATGCG 
GGTAC TACGC 

2140 

CGCGG GGTTC 
GCGCC CCAAG 

2200 

CGTGG TGGCG 
GCACC ACCGC 



2030 
* 

ACCGG GGCCG 
TGGCC CCGGC 

2090 
* 

GCGGG CTTCT 
CGCCC GAAGA 

2150 
* 

TTTCA TCCGG 
AAAGT AGGCC 

2210 

GGTCT GTCCG 
CCAGA CAGGC 



2040 

w 

AGGTC CGGCG 
TCCAG GCCGC 

2100 

*r 

GGGCC GATCA 
CCCGG CTAGT 

2160 

w 

TCACC GGCTA 
AGTGG CCGAT 

2220 

GGCCG CCCGG 
CCGGC GGGCC 



2230 2240 2250 2260 2270 2230 

CACCG ACGCG CTGCG CGGCG CCATC CGCGA TTACG CGATC GACCG GGCGC GCCGC GACCG 
GTGGC TGCGC GACGC GCCGC GGTAG GCGCT AATGC GCTAG CTGGC CCGCG CGGCG CTGGC 

2290 2300 2310 2320 2330 2340 

CTTTC TGCGC CTTTT GAACC GGATG CTGTT CCGCG GCTGC GCGCC CGACC GGCGC TATAC 
GAAAG ACGCG GAAAA CTTGG CCTAC GACAA GGCGC CGACG CGCGG GCTGG CCGCG ATATG 

2350 2360 2370 2380 2390 2400 

* * * * w * 

CC7GC TGCAG CGGTT CTACC GCATG CCGCA TGGAC TGATC GAACG GTTCT ATGCC GGCCG 
GGACG ACGTC GCCAA GATGG CGTAC GGCGT ACCTG ACT AG CTTGC CAAGA TACGG CCGGC 

2410 2420 2430 2440 2450 2460 

* * * * * 

GCTGA GCGTG GCGGA TCAGC TGCGC ATCGT GACCG GCAAG CCTCC CATTC CCCTT GGCAC 
CGACT CGCAC CGCCT AGTCG ACGCG TAGCA CTGGC CGTTC GGAGG GTAAG GGGAA CCGTG 

2470 2480 2450 2500 2510 2520 

GGCCA TCCGC TGCCT GCCCG AACGT CCCCT GCTGA AGGAA AACGC ATGAA CGCCC ATTCG 
CCGGT AGGCG ACGGA CGGGC TTGCA GGGGA CGACT TCCTT TTGCG TACTT GCGGG TAAGC 



2530 2540 2550 2560 H2570 2580 

" « « *r ' * * 

CCCGC GGCCA AGACC GCCAT CGTGA TCGGC GCAGG CTTTG GCGGG CTGGC CCTGG CCATC 
GGGCG CCGGT TCTGG CGGTA GCACT AGCCG CGTCC GAAAC CGCCC GACCG GGACC GGTAG 

2590 2600 2610 2620 2630 2540 

CGCCT GCAGT CCGCG GGCAT CGCCA CCACC CTGGT CGAGG CCCGG GACAA GCCCG GCGGG 
GCGGA CGTCA GGCGC CCGTA GCGGT GGTGG GACCA GCTCC GGGCC CTGTT CGGGC CGCCC 
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2650 



2660 2670 2680 2690 2700 



CGCGC C7A7G TCTGG CACGA 7CAGG GCCAT CTCT7 CGACG CGGGC CCGAC CGTCA TCACC 
GCGCG GA7AC AGACC GTGCT AG7CC CGGTA GAGAA GC7GC GCCCG GGCTG GCAGT AGTGG 



2710 



2720 2730 2740 2750 2760 



GACCC CGATG CGCTG AAAGA GCTGT GGGCC C7GAC CGGGC AGGAC ATGGC GCGCG ACGTG 
CTGGG GCTAC GCGAC TTTCT CGACA CCCGG GACTG GCCCG 7CCTG 7ACCG CGCGC 7GCAC 

2770 2780 2790 2800 2610 2820 

ACGC7 GA7GC CGG7C 7CGCC C77C7 A7CGG C7GA7 G7GGC CGGGC GGGAA GG7C7 7CGA7 
7GCGA C7ACG GCCAG AGCGG GAAGA 7AGCC GAC7A CACCG GCCCG CCC77 CCAGA AGC7A 



2830 



2840 2850 2860 2870 2880 



7ACG7 GAACG AGGCC GA7CC AGGG7 C7GGG 7C77G CCG7G CCAGG 7GAAG C7G77 GCCG7 
A7GCA C77GC 7CCGG C7AGG 7CCCA GACCC AGAAC GGCAC GG7CC AC77C GACAA CGGCA 



2886 
* 

GGA7C C 
CC7AG G 
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y 110 120 130 140 150 

ATGTCCGGACGGAAGCCTGGCACAACTGGCGACACGATCGTCAATCTCGGTCTGACCGCC 
1 MetSerGlyArgLysProGlyThrThrGlyAspThrlleValAsnLeuGlyLeuThrAla 

160 170 180 190 200 210 

GCGATCCTGCTGTGCTGGCTGGTCCTGCACGCCTTTACGCTATGGTTGCTAGATGCGGCC 
21 AlalleLeuLeuCysTrpLeuValLeuHisAlaPheThrLeuTrpLeuLeuAspAlaAla 

220 230 240 250 260 270 

GCGCATCCGCTGCTTGCCGTGCTGTGCCTGGCTGGGCTGACCTGGCTGTCGGTCGGGCTG 
41 AlaHisProLeuLeuAlaValLeuCysLeuAlaGlyLeuThrTrpLeuSerValGlyLeu 

280 290 300 310 320 330 

TTCATCATCGCGCATGACGCAATGCACGGGTCCGTGGTGCCGGGGCGGCCGCGCGCCAAT 
61 PhellelleAlaHisAspAlaMetHisGlySerValValProGlyArgProArgAlaAsn 

340 350 360 370 380 390 

GCGGCGATCGGGCAACTGGCGCTGTGGCTCTATGCGGGGTTCTCGTGGCCCAAGCTGATC 
81 AlaAlalleGlyGlnLeuAlaLeuTrpLeuTyrAlaGlyPheSerTrpProLysLeuIle 

400 410 420 430 440 450 

GCCAAGCACATGACGCATCACCGGCACGCCGGCACCGACAACGATCCCGATTTCGGTCAC 
101 AlaLysHisMetThrHisHisArgHisAlaGlyThrAspAsnAspProAspPheGlyHis 

460 470 480 490 500 510 

GGAGGGCCCGTGCGCTGGTACGGCAGCTTCGTCTCCACCTATTTCGGCTGGCGAGAGGGA 
121 GlyGlyProValArgTrpTyrGlySerPheValSerThrTyrPheGlyTrpArgGluGly 

520 530 540 550 560 570 

CTGCTGCTACCGGTGATCGTCACCACCTATGCGCTGATCCTGGGCGATCGCTGGATGTAT 
141 LeuLeuLeuProVallleValThrThrTyrAlaLeuIleLeuGlyAspArgTrpMetTyr 

580 590 600 610 620 630 

GTCATCTTCTGGCCGGTCCCGGCCGTTCTGGCGTCGATCCAGATTTTCGTCTTCGGAACT 
161 ValllePheTrpProValProAlaValLeuAlaSerlleGlnllePheValPheGlyThr 

640 650 660 670 680 690 

TGGCTGCCCCACCGCCCGGGACATGACGATTTTCCCGACCGGCACAACGCGAGGTCGACC 
181 TrpLeuProHisArgProGlyHisAspAspPheProAspArgHisAsnAlaArgSerThr 

700 710 720 730 740 750 

GGCATCGGCGACCCGTTGTCACTACTGACCTGCTTCCATTTCGGCGGCTATCACCACGAA 
201 GlylleGlyAspProLeuSerLeuLeuThrCysPheHisPheGlyGlyTyrHisHisGlu 
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760 



770 



780 



790 



800 



810 



CATCACCTGCATCCGCATGTGCCGTGGTGGCGCCTGCCTCGTACACGCAAGACCGGAGGC 
221 HisHisLeuHisProHisValProTrpTrpArgLeuProArgThrArgLysThrGlyGly 

820 827 
CGCGCATGA 
241 ArgAla*** 
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CI 

J 830 840 850 860 870 B80 

ATGACGCAATTCCTCATTGTCGTGGCGACAGTCCTCGTGATGGAGCTGACCGCCTATTCC 
1 MetThrGlnPheLeuIleValValAlaThrValLeuValMetGluLeuThrAlaTyrSer 

890 900 910 920 930 940 

GTCCACCGCTGGATTATGCACGGCCCCCTAGGCTGGGGCTGGCACAAGTCCCATCACGAA 
21 ValHisArgTrpIleMetHisGlyProLeuGlyTrpGlyTrpHisLysSerHisHisGlu 

950 960 970 980 990 1000 

GAGCACGACCACGCGTTGGAGAAGAACGACCTCTACGGCGTCGTCTTCGCGGTGCTGGCG 
41 GluHisAspHisAlaLeuGluLysAsnAspLeuTyrGlyValValPheAlaValLeuAla 

1010 1020 1030 1040 1050 1060 

ACGATCCTCTTCACCGTGGGCGCCTATTGGTGGCCGGTGCTGTGGTGGATCGCCCTGGGC 
61 ThrlleLeuPheThrValGlyAlaTyrTrpTrpProValLeuTrpTrpIleAlaLeuGly 

1070 1080 1090 1100 1110 1120 

ATGACGGTCTATGGGTTGATCTATTTCATCCTGCACGACGGGCTTGTGCATCAACGCTGG 
81 MetThrValTyrGlyLeuIleTyrPhelleLeuHisAspGlyLeuValHisGlnArgTrp 

1130 1140 1150 1160 1170 1180 

CCGTTTCGGTATATTCCGCGGCGGGGCTATTTCCGCAGGCTCTACCAAGCTCATCGCCTG 
101 ProPheArgTyrlleProArgArgGlyTyrPheArgArgLeuTyrGlnAlaHisArgLeu 

1190 1200 1210 1220 1230 1240 

CACCACGCGGTCGAGGGGCGGGACCACTGCGTCAGCTTCGGCTTCATCTATGCCCCACCC 
121 HisHisAlaValGluGlyArgAspHisCysValSerPheGlyPhelleTyrAlaProPro 

1250 1260 1270 1280 1290 1300 

GTGGACAAGCTGAAGCAGGATCTGAAGCGGTCGGGTGTCCTGCGCCCCCAGGACGAGCGT 
141 ValAspLysLeuLysGlnAspLeuLysArgSerGlyValLeuArgProGlnAspGluArg 

1312 
CCGTCGTGA 
161 ProSer*** 



to 

FIG. 15 



BNSDOCID: <EP 07351 37A1J_> 



63 



EP 0 735 137 A1 



10 20 30 40 50 60 

CTGCA GGCCG GGCCC GGTGG CCAAT GGTCG CAACC GGCAG GACTG GAACA GGACG GCGGG 
GACGT CCGGC CCGGG CCACC GGTTA CCAGC GTTGG CCGTC CTGAC CTTGT CCTGC CGCCC 

Al 

70 80 90 f 110 120 

CCGGT CTAGG CTGTC GCCCT ACGCA GCAGG AGTTT CGGAT GTCCG GACGG AAGCC TGGCA 
GGCCA GATCC GACAG CGGGA TGCGT CGTCC TCAAA GCCTA CAGGC CTGCC TTCGG ACCGT 

130 140 .150 160 170 180 

CAACT GGCGA CACGA TCGTC AATCT CGGTC TGACC GCCGC GATCC TGCTG TGCTG GCTGG 
GTTGA CCGCT GTGCT AGCAG TTAGA GCCAG ACTGG CGGCG CTAGG ACGAC ACGAC CGACC 

190 200 210 220 230 240 

TCCTG CACGC CTTTA CGCTA TGGTT GCTAG ATGCG GCCGC GCATC CGCTG CTTGC CGTGC 
AGGAC GTGCG GAAAT GCGAT ACCAA CGATC TACGC CGGCG CGTAG GCGAC GAACG GCACG 

250 260 270 280 290 300 

TGTGC CTGGC TGGGC TGACC TGGCT GTCGG TCGGG CTGTT CATCA TCGCG CATGA CGCAA 
ACACG GACCG ACCCG ACTGG ACCGA CAGCC AGCCC GACAA GTAGT AGCGC GTACT GCGTT 

310 320 330 340 350 360 

TGCAC GGGTC CGTGG TGCCG GGGCG GCCGC GCGCC AATGC GGCGA TCGGG CAACT GGCGC 
ACGTG CCCAG GCACC ACGGC CCCGC CGGCG CGCGG TTACG CCGCT AGCCC GTTGA CCGCG 

370 380 390 400 410 420 

TGTGG CTCTA TGCGG GGTTC TCGTG GCCCA AGCTG ATCGC CAAGC ACATG ACGCA TCACC 
ACACC GAGAT ACGCC CCAAG AGCAC CGGGT TCGAC TAGCG GTTCG TGTAC TGCGT AGTGG 

430 440 450 460 470 480 

GGCAC GCCGG CACCG ACAAC GATCC CGATT TCGGT CACGG AGGGC CCGTG CGCTG GTACG 
CCGTG CGGCC GTGGC TGTTG CTAGG GCTAA AGCCA GTGCC TCCCG GGCAC GCGAC CATGC 

490 500 510 520 530 540 

GCAGC TTCGT CTCCA CCTAT TTCGG CTGGC GAGAG GGACT GCTGC TACCG GTGAT CGTCA 
CGTCG AAGCA GAGGT GGATA AAGCC GACCG CTCTC CCTGA CGACG ATGGC CACTA GCAGT 

550 560 570 580 590 600 

CCACC TATGC GCTGA TCCTG GGCGA TCGCT GGATG TATGT CATCT TCTGG CCGGT CCCGG 
GGTGG ATACG CGACT AGGAC CCGCT AGCGA CCTAC ATACA GTAGA AGACC GGCCA GGGCC 

610 620 630 640 650 660 

CCGTT CTGGC GTCGA TCCAG ATTTT CGTCT TCGGA ACTTG GCTGC CCCAC CGCCC GGGAC 
GGCAA GACCG CAGCT AGGTC TAAAA GCAGA AGCCT TGAAC CGACG GGGTG GCGGG CCCTG 

670 680 690 700 710 720 

ATGAC GATTT TCCCG ACCGG CACAA CGCGA GG.TCG ACCGG CATCG GCGAC CCGTT GTCAC 
TACTG CTAAA AGGGC TGGCC GTGTT GCGCT CCAGC TGGCC GTAGC CGCTG GGCAA CAGTG 
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730 740 750 760 770 780 

TACTG ACCTG CTTCC ATTTC GGCGG CTATC ACCAC GAACA TCACC TGCAT CCGCA TGTGC 
ATGAC TGGAC GAAGG TAAAG CCGCC GATAG TGGTG CTTGT AGTGG ACGTA GGCGT ACACG 

CI 

790 800 810 820 J 830 840 

CGTGG TGGCG CCTGC CTGGT ACACG CAAGA CCGGA GGCCG CGCAT GACGC AATTC CTCAT 
GCACC ACCGC GGACG GAGCA TGTGC GTTCT GGCCT CCGGC GCGTA CTGCG TTAAG GAGTA 



850 860 870 880 |b 890 900 

TGTCG TGGCG ACAGT CCTCG TGATG GAGCT GACCG CCTAT TCCGT CCACC GCTGG AT TAT 
ACAGC ACCGC TGTCA GGAGC ACTAC CTCGA CTGGC GGATA AGGCA GGTGG CGACC TAATA 

910 920 930 940 950 960 

GCACG GCCCC CTAGG CTGGG GCTGG CACAA GTCCC ATCAC GAAGA GCACG ACCAC GCGTT 
CGTGC CGGGG GATCC GACCC CGACC GTGTT CAGGG TAGTG CTTCT CGTGC TGGTG CGCAA 

970 980 990 1000 1010 1020 

GGAGA AGAAC GACCT CTACG GCGTC GTCTT CGCGG TGCTG GCGAC GATCC TCTTC ACCGT 
CCTCT TCTTG CTGGA GATGC CGCAG CAGAA GCGCC ACGAC CGCTG CTAGG AGAAG TGGCA 

1030 1040 1050 1060 1070 1080 

GGGCG CCTAT TGGTG GCCGG TGCTG TGGTG GATCG CCCTG GGCAT GACGG TCTAT GGGTT 
CCCGC GGATA ACCAC CGGCC ACGAC ACCAC CTAGC GGGAC CCGTA CTGCC AGATA CCCAA 

1090 1100 1110 1120 1130 1140 

GATCT ATTTC ATCCT GCACG ACGGG CTTGT GCATC AACGC TGGCC GTTTC GGTAT ATTCC 
CTAGA TAAAG TAGGA CGTGC TGCCC GAACA CGTAG TTGCG ACCGG CAAAG CCATA TAAGG 

1150 1160 1170 1180 ' 1190 1200 

GCGGC GGGGC TATTT CCGCA GGCTC TACCA AGCTC ATCGC CTGCA CCACG CGGTC GAGGG 
CGCCG CCCCG AT AAA GGCGT CCGAG ATGGT TCGAG TAGCG GACGT GGTGC GCCAG CTCCC 

1210 1220 1230 1240 1250 1260 

GCGGG ACCAC TGCGT CAGCT TCGGC TTCAT CTATG CCCCA CCCGT GGACA AGCTG AAGCA 
CGCCC TGGTG ACGCA GTCGA AGCCG AAGTA GAT AC GGGGT GGGCA CCTGT TCGAC TTCGT 

1270 1280 1290 1300 1310 1320 

GGATC TGAAG CGGTC GGGTG TCCTG CGCCC CCAGG ACGAG CGTCC GTCGT GATCT CTGAT 
CCTAG ACTTC GCCAG CCCAC AGGAC GCGGG GGTCC TGCTC GCAGG CAGCA CTAGA GACTA 

iP 

1330 1340 1350 1360 13713 1380 

CCCGG CGTGG CCGCA TGAAA TCCGA CGTGC TGCTG GCAGG GGCCG GCCTT GCCAA CGGAC 

GGGCC GCACC GGCGT ACTTT AGGCT GCACG ACGAC CGTCC CCGGC CGGAA CGGTT GCCTG 

1390 1400 1410 1420 1430 1440 

TGATC GCGCT GGCGA TCCGC AAGGC GCGGC CCGAC CTTCG CGTGC TGCTG CTGGA CCGTG 
ACTAG CGCGA CCGCT AGGCG TTCCG CGCCG GGCTG GAAGC GCACG ACGAC GACCT GGCAC 
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1450 1460 1470 
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